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Uncharted territory 


Political maps that seek to advance disputed territorial claims have no place in scientific papers. 
Researchers should keep relationships cordial by depoliticizing their work. 


uhammad Ali observed that the wars of nations are fought 
M to change maps — and he wasa man who knew how to fight. 

Yet there are more subtle ways to change maps. Take the 
South China Sea: Chinese officials insist that much of its waters belong 
to China, and Chinese maps tend to include a dotted line that makes 
the same point. Yet there is no international agreement that China 
should have possession, and other countries have overlapping claims. 

What has this to do with science and Nature? Nothing — except 
that territorial disputes, including that over the South China Sea, 
are leaking into the pages of scientific journals such as this one. Ina 
disturbing trend, an increasing number of maps included in scientific 
articles by Chinese researchers feature a dotted line that envelops 
almost the entire South China Sea, to indicate Chinese possession 
(see page 293). Scientists and citizens of surrounding countries are 
understandably peeved by the maps, which in most cases are com- 
pletely unrelated to the subjects of the papers in which they are pub- 
lished. The inclusion of the line is not a scientific statement — it isa 
political one, apparently ordered by the Chinese government. It’s a 
territorial claim, and it’s being made in the wrong place. 

Where research and politics mix, science should be a tool of 
diplomacy, not territorial aggression. Even politically hostile envi- 
ronments can prove fertile ground for scientific collaborations. An 
increasing number of researchers from Taiwan are teaming up with 
colleagues in mainland China, even as Beijing and Taipei continue 
to fundamentally disagree over their relationship. According to data 
provided by Lou-Chuang Lee, the head of Taiwan's National Science 
Council, the number of research papers resulting from cross-strait 
collaborations rose from 521 in 2005 to 1,207 last year. 

Such collaborations set the stage for the realization of common 
interests and, one might hope, resolution of political differences. At 
the least, they could help to restrain aggression. 

Still, politics does often find a way to intrude. In August, for 


example, Ann-Shyn Chiang, director of the Brain Research Center 
at the National Tsing Hua University in Hsinchu, Taiwan, was sur- 
prised by a request from Yi Rao, a neuroscientist at Peking University 
in Beijing, with whom he was writing a paper. Rao wanted to put 
down Chiang’s affiliation as “Taiwan, China, the appellation preferred 
by Beijing. Chiang told Rao either to use Taiwan or Taiwan ROC 

(Republic of China), or to drop his name from the author list. 
Eventually the two found a compromise, agreeing that they would 
use Taiwan, Republic of China. The dispute 


“Whereresearch over the South China Sea, with its resources 
and politics and geopolitical significance, won't be so eas- 
mix, science ily ironed out. 

should be a tool With regard to this and other interna- 
of diplomacy, tional disputes, Nature takes the position 
not territorial that scientists should stick to the science. 


Authors should try to depoliticize their 
articles as much as possible by avoiding 
inflammatory remarks, contentious statements and controversial 
map designations. If such things can’t be avoided, for example if a 
study of a country’s resources requires taking account of whether 
a certain island belongs to it, the map should be marked as ‘under 
dispute’ or something to that effect. In papers in Nature, editors 
reserve the right to insert such a label if authors fail to do so. By 
avoiding controversy, researchers who keep politics from contami- 
nating their science will keep the doors of collaboration open, and 
their studies will benefit. Researchers could also, as a by-product, 
help to defuse political tensions, show the way to mutual benefit and 
perform a diplomatic service. 

Researchers on all sides have much in common, as many scientists 
in parts of the world made unstable by conflict can appreciate. It makes 
no sense to undermine this solidarity through irrelevant political and 
territorial posturing. m 


aggression.” 


@ @ @ 
Positive spin 
Science lobbyists must boost the appeal of 
research to policy-makers. 


the combination of the economic downturn, concerns over 
the budget deficit and anti-science rhetoric from the Tea Party 
have created a difficult environment for those paid to persuade US 
lawmakers to find funds for research. And money available to lobby- 
ists to make the case is in short supply too, as sponsor organizations 


Pp ity the science lobbyist. As we report on page 299 of this issue, 


watch their own budgets in the struggling economy. 

The good news, at least, is that lobbyists are aware of the problems 
and have a pitch that takes some account of them. Their arguments 
now routinely stress the importance of research to US economic 
growth, health, welfare and competitiveness — and point out that 
research institutions are major employers in districts that include 
those represented by Tea Party members. A change in terminology, 
from ‘science funding’ to ‘science investment; is particularly smart. 

Yet there are many types of investment — roads, primary education 
and crime reduction, to name but a few — that must compete for an 
ever-decreasing pot of funds, and science advocates could do more to 
respond to the shifting mood in Washington. 

First, lobbyists should argue that scientists spend US taxpayers’ 
money efficiently. Campaigners can point to changes that the scientific 
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community is making to ensure that is the case. As Nature reported in 
August (see Nature 476, 385; 2011), a study by the National Academy of 
Sciences on research universities, due out by the end of 2011, is expected 
to recommend ways in which institutions could share resources and be 
more thrifty. Leaders of the medical community, who have seen fund- 
ing through the National Institutes of Health double between 1998 and 
2003 but slow since, are also making informal preparations to down- 
size their enterprise or survive on a flat budget. That the US scientific 
community is taking its cue from lawmakers to streamline operations 
reflects well, and could help to counter the image painted by critics of 
scientists as elitist ivory-tower types in ill-deserved, comfortable jobs. 

Second, lobbyists should stress how money spent on science con- 
tributes to education. This requires action from the scientists too, to 
make sure they take their teaching responsibilities seriously. There 
is increasing political scrutiny of this role, which too many faculty 
researchers still regard as an inconvenience. 

In Texas, governor and Republican presidential hopeful Rick Perry 
has introduced a more business-orientated approach to education 
that could, this month, see nearly half the undergraduate physics 
programmes in the state penalized with probation or closure if not 
enough students are graduating. 

In response, physics faculties have pointed out, correctly, that such 
a severe policy would bar entry to science to minorities and students 
in poor areas, from which enrolment in science subjects is regret- 
tably low. Up to one-third of graduates with undergraduate physics 
degrees in the United States come from programmes that would not 
meet the Texas requirements — to graduate 25 students in 5 years 
(see Nature http://dx.doi.org/10.1038/news.2011.559; 2011). This 


diversity should be protected, not whittled away. 

But lobbyists should take note: Texas physicists have responded too, 
with pledges to refocus their teaching responsibilities to educate and 
graduate more students. And many research universities are working 
to reward good teaching by staff scientists with promotions. 

Similarly, scientists across the United States are taking a more per- 
sonal stake in the education of the nation’s children, a particularly 

important step given that anti-science rheto- 


“Too many ric flourishes in an atmosphere of scientific 
faculty illiteracy. 

researchers Programmes such as PhysTEC, a national 
stillregard network of institutions working to get phys- 
teaching as an ics graduates into school classrooms, and the 


UTeach Institute in Austin, Texas, which does 
the same for all scientific and mathematical 
fields, are working to address the paradox that, although bachelor’s 
graduation rates in physics are at an all-time high across the United 
States, there is a shortage of well-qualified physics teachers in second- 
ary schools. Lobbyists could and should highlight such programmes 
as examples of publicly funded scientists giving back to the public. 

Itis easy to be pessimistic and to assume that science investment will 
suffer with the economic and political fortunes of the United States, 
no matter what scientists do. But scientists, lobbyists and the wider 
research community must guard against such fatalism. The positive 
vibe generated by recent lobbying events in Washington DC and the 
fact that, historically, both US political parties have worked to protect 
science from cuts, is cause for optimism. The message simply needs 
alittle fine-tuning. = 


inconvenience. ” 


The PSA position 


The US government must take a firm stance on 
whether prostate-cancer screening is justified. 


public health, especially in the United States, have an even rock- 
ier relationship. So when a respected panel of experts assessed 
the evidence for a controversial test for prostate cancer, and found US 
policy wanting, it would have known it was entering dangerous waters. 

The US Preventive Services Task Force therefore deserves much 
kudos for its conclusion earlier this month that the prostate-specific 
antigen (PSA) test does more harm than good when used to screen 
healthy men for cancer. 

Doctors in the United States routinely use the PSA test, not least 
because federal insurance provider Medicare pays for so many of them. 

The task force’s recommendation that the test should not be used 
in individuals with no symptoms, which is now open for public com- 
ment, has already been attacked by many doctors. The American Uro- 
logical Association in Linthicum, Maryland, has started a campaign 
against the proposal, and newspapers have dedicated a large amount 
of space to letters from men stating that PSA tests saved their lives. 

True, the data point in contrasting directions on the benefits of 
taking the test, given the unpleasant side effects of surgery — such 
as impotence and incontinence — to remove tumours that may not 
prove deadly. That is why the task force’s conclusion is so important: 
the balance of the evidence, however inconvenient, shows that routine 
use of the test is a liability to public health. 

“The common perception that PSA-based early detection of pros- 
tate cancer prolongs lives is not supported by the scientific evidence,” 
says the task force. Instead, “there is moderate certainty that the harms 
of PSA-based screening for prostate cancer outweigh the benefits”. 

The recommendation builds on a more tentative finding from the 


P olitics and science do not always see eye to eye, but politics and 
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same group in 2008, which said that the evidence did not support PSA 
screening in men aged 75 and older and that the picture was unclear 
for younger men. 

The update has not come as a huge surprise to medical experts 
outside the United States. The PSA test has long been viewed with 
suspicion in Europe — where it is not nearly so widely used. Even 
the man who invented the test — Richard Ablin — wrote in the New 
York Times last year that his work had led to a “profit-driven public 
health disaster”. 

Unfortunately, the “common perception’ is strong. The latest analy- 
sis will not directly change the way that prostate cancer is screened for 
in the United States. For one thing, the PSA tests funded by Medicare 
are protected by legislation. But change may be afoot. Last year, the 
American Cancer Society in Atlanta, Georgia, subtly modified its 
guidelines to cast more doubt on the usefulness of the test in screen- 
ing healthy men. 

If confirmed, the task force’s recommendation will provide a 
renewed opportunity for evidence to be put at the heart of policy, 
however uncomfortable that might be in the short term. 

Last time the task force issued a controversial recommendation 
— on breast-cancer screening in 2009 — health secretary Kathleen 
Sebelius effectively disowned the body. She has given no indication 
that this will happen this time round. The Department of Health and 
Human Services is waiting for the final recommendation before it 
weighs in. But when push comes to shove, the government must find 
the courage to act. 

PSA testing is big business, and allegations that scrapping it will 
see men being left to die from cancer are corrosive. But the PSA 
debate must not be about the money. It should be about the health 
of millions of men and how to collect and judge the research that 
informs their care. 

Those who argue against the decision should 
arm themselves with supporting data, not politi- 
cal attacks and anecdotes. And policy-makers 
must deliver firm support for the task force, or 
give good reasons why they choose not to. m 
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WORLD VIEW .ennisicorsen 


Nature has just published an analysis of the genome of the 
naked mole rat, and a historic sequence of the plague bacterium 
Yersinia pestis, reconstructed from medieval victims of the Black Death. 
What is the value of such research? In my field, infectious disease, 
the genomics revolution has been substantially less beneficial than 
was anticipated. To those of us who work in poor countries in which 
infectious diseases that have been all but eliminated in the West are still 
common, it is clear that the wide-eyed optimism and persuasive advo- 
cacy for genomics that were expressed at the turn of the century were 
at best naive, and at worst potentially damaging, diverting resources 
from more worthy causes. Certainly, the outcomes have not matched 
those promised as justification for funding the research. 

Ihave personal experience of the hype and the reality. Ten years ago, 
I was an author on the paper that announced the genome sequence of 
Salmonella enterica Typhi, the microorganism that 
causes typhoid fever (J. Parkhill et al. Nature 413, 
848-852; 2001). The research was promoted with 
great fanfare, which declared that scientists were 
at a turning point in the fight against the disease. 
A decade on, we are no closer to a global solution. 

The S. enterica Typhi sequence did help us to 
understand the biology of the organism and to 
address enigmas such as why it causes disease 
only in humans, how it develops resistance to 
antimicrobial agents and how it modulates the 
human immune response. And although not 
even the most optimistic genomics or public- 
health expert would have predicted eradication 
of the disease within a decade, the sequencing 
did at least make elimination a realistic target. 

But the promised concrete benefits — bespoke 
treatments, next-generation vaccines and low- 
cost diagnostics — have failed to materialize. 

The technological expertise and funding for genomics work and sub- 
sequent studies are concentrated in countries that, owing to geography 
and economics, are not affected by typhoid. Researchers and funders 
are detached from the disease’ realities. As a consequence, follow-up 
funding, expertise and advocacy are weakened. There is no motivation 
like seeing a patient every day, seeing a disease rife in your community 
or knowing someone with a serious illness. 

By contrast, HIV does affect the scientific superpowers. In wealthy 
countries, there has been remarkable progress in public health and 
clinical practice since the virus was discovered and subsequently 
sequenced. This is a testament to the power of 


( : enome sequences come thick and fast in modern science. 


scientific investment allied with politicaland ODNATURE.COM 

community support, and powerful activism. Discuss this article 
In this light, the reasons for a lack of success __ online at: 

with typhoid — and other infectious diseases _ go.nature.com/hxfzen 


THERE ARE NO 
ADVOCACY 
GROUPS 


FOR TYPHOID AND 
OTHER 


DISEASES OF 
POVERTY. 


Genomic medicine has 
failed the poor 


A decade on from a landmark genetic-sequencing study, promised progress 
on typhoid fever has not materialized, says Stephen Baker. 


endemic to low- and middle-income countries — are depressingly 
obvious. Affected people simply do not have the clout to force the sci- 
entific community to build on laboratory research. 

There are no advocacy groups for typhoid and other diseases of pov- 
erty similar to those that exist for HIV. Affected people and communities 
are not powerful constituencies. Decision-makers in endemic countries 
are typically drawn from the wealthier classes, and few have had typhoid 
fever, or have known someone who has died from the disease. 

The position of major international donors — including the Bill 
& Melinda Gates Foundation in Seattle, Washington, and the US 
National Institutes of Health in Bethesda, Maryland — does not help. 
They focus on a narrowing range of diseases such as malaria and 
tuberculosis: priorities typically determined by metrics and disease 
league tables. Anything else is left mostly untouched. 

It can bea vicious circle. Take diagnosis: there is still no good way to 
detect typhoid, so global estimates of the extent 
of the problem are based on guesswork and are 
almost certainly underestimates. The disease thus 
has low priority in international assessments. 
Genome-based diagnostic methods are being 
explored, yet they rarely reach the field — partly 
because they are designed for use in well funded, 
technologically advanced laboratories. Academic 
funding focuses on research, with little left to sup- 
port development and testing in endemic loca- 
tions. Many methods simply die before testing, 
or as a consequence of trials poorly executed on 
shoestring budgets. 

We need a new approach. Scientists who fight 
diseases such as typhoid through laboratory 
research must work more closely with those of 
us who do the same in the field. 

Funding agencies that support investigators 
who use genomics research must give them more 
opportunities to see the nature of the infection in endemic countries. 
One way is to increase support for consortia and collaborations that 
already aim to bridge the geographical and technological gap. 

Such initiatives also require more support and advocacy from min- 
istries of health in developing countries, to welcome international col- 
laboration and make possible academic and public-health partnerships. 

If these crucial issues are not addressed, I fear that the sequence of 
the S. enterica Typhi genome, and others like it, will persist as a fasci- 
nating academic advance, mostly useless in a public-health context. m 


Stephen Baker is head of enteric infections at the Wellcome Trust 
Major Overseas Programme, Oxford University Clinical Research 

Unit. He coordinates field and laboratory research in Ho Chi Minh 
City, Vietnam, and Kathmandu, Nepal. 

e-mail: sbaker@oucru.org 
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Stem-cell patents 


Procedures using stem cells 
derived from human embryos 
cannot be patented, the 
European Court of Justice ruled 
on 18 October. The judgment 
— which agrees with the court’s 
preliminary ruling in March — 
has dismayed some stem-cell 
scientists. Oliver Briistle, 
director of the Institute of 
Reconstructive Neurobiology 
at the University of Bonn in 
Germany, called the decision 

a setback for biomedical 
research. His 1997 patent ofa 
technique to generate nerve 
cells from human embryonic 
stem-cell lines sparked the legal 
debate. See go.nature.com/ 
oxbnsf for more. 


Carbon tax 

After years of politicking, 
Australia seems set to 
introduce a tax on carbon 
dioxide emissions. The 
government's proposed 

tax, of Aus$23 (US$24) per 
tonne for the country’s top 

500 emitters, was narrowly 
passed by the federal House of 
Representatives on 12 October. 
If passed by the Senate in 
November, as looks likely, the 
tax will come into effect on 

1 July 2012, and its price would 
increase by 2.5% a year above 
inflation until an emissions- 
trading scheme replaces 

it in 2015. See go.nature. 
com/9afdee for more. 


Censored in Texas 


Scientists last week withdrew 
their names from a draft 
coastal assessment after 

the administration of Texas 
governor Rick Perry censored 
the report to remove reference 
to rising sea levels. The state's 
environment agency, which 
commissioned the private, 
non-profit Houston Advanced 
Research Center to assess 
Galveston Bay, on the Gulf of 
Mexico near Houston, said the 


a 
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Fabulous fossil find 


This near-complete fossil of a young theropod, 
pictured under ultraviolet light, is the best- 
preserved dinosaur skeleton found in Europe, 
according to Oliver Rauhut, curator at the 
Bavarian State Collection for Palaeontology and 
Geology in Munich, Germany, which announced 
the find on 12 October. The 72-centimetre 


chapter in question did not 
conform with administration 
policy. Perry, a Republican 
presidential candidate, has 
said he does not believe in 
anthropogenic climate change. 
See go.nature.com/olkdvb for 
more. 


Russian protests 


About 500 Russian scientists 
rallied in Moscow on 

13 October to protest against 
cuts and red tape in national 
research funding. Protesters 
want more money for the 
Russian Foundation for 
Basic Research, Russia's only 
grant-giving science-funding 
agency. The government 

has decided to freeze the 
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agency's budget — less than 
US$200 million per year — at 
2010 levels. Researchers also 
complained about excessive 
state bureaucracy involved in 
the purchase of reagents and 
research equipment. See Nature 
477, 5 (2011) and page 324. 


US-China block 


By holding meetings that 
involved discussions with 
China, the White House Office 
of Science and Technology 
Policy (OSTP) violated a law 
that bans certain scientific 
activities between the two 
countries, a Government 
Accountability Office report 
found on 11 October. The 
congressman who in April 


skeleton is 135 million years old, around 

98% intact, and has some preserved skin and 
protofeathers. It was found near the town of 
Kelheim, Bavaria, and will go on public display at 
the Munich Show from 27 October, before being 
moved to the Biirgermeister-Miiller Museum in 
Solnhofen. See go.nature.com/zyyt5f for more. 


pushed through that ban 

— Frank Wolf (Republican, 
Virginia) — has turned up 
the heat on OSTP director 
John Holdren by asking the 
US attorney-general to take 
action. See page 294 for more. 


| _BUSINESS 
deCODE and Pfizer 


Icelandic genomics company 
deCODE genetics is teaming 
up with pharmaceutical 

giant Pfizer to hunt for rare 
genetic variants linked to the 
autoimmune disease systemic 
lupus erythematosus. The 
deal is deCODE'’s first with 

a drug firm since it declared 
bankruptcy in November 


H. TISCHLINGER/PHOTOSHOT 


= 2009, which happened in 
8 part because of its inability 
to translate genetic disease 
associations into drugs (see 
Nature 462, 401; 2009). The 
company has since become 
amore svelte research and 
genomics testing firm. The 
> deal was announced on 12 
October, but terms have not 
been made public. 


WILL/JAPAN PRIZE FOUNDATION. 


Oil-spill charges 

The US government's 
offshore-drilling regulator 

last week formally charged BP 
with seven safety violations 

in the lead up to the April 
2010 explosion on the firm's 
Deepwater Horizon oil rig 

in the Gulf of Mexico. BP’s 
contractors, Halliburton and 
Transocean, were each handed 
four notices of violations. 

The three companies are 

also all suing each other. 

The fines could total around 
US$45 million, but BP also 
faces billions of dollars in 
penalties under environmental 
legislation relating to the oil 
that leaked after the explosion. 


Unix pioneer dies 


Computer scientist Dennis 
Ritchie (pictured), who 
created the C programming 
language and co-developed 
the Unix operating system, 
died last week aged 70. Ritchie, 
who studied mathematics 

and computing at Harvard 


TREND WATCH 


SOURCE: S. J. DAVIS, G. P. PETERS & K. CALDEIRA/PNAS 


Carbon dioxide emissions are 
attributed to the country that 
produces them. Yet around 
one-fifth of China's emissions, 
for example, come from making 
goods demanded by consumers 
in other nations. A 17 October 
analysis (S. J. Davis et al. Proc. 
Natl Acad. Sci. USA http://dx.doi. 
org/10.1073/pnas.1107409108; 


2011) completes this supply chain, 


analysing both the emissions 
embodied in final consumer 
demand and where fossil fuels 
were originally extracted. The 
chart shows the results for Japan. 


University, worked at Bell Labs 
from 1967 until he retired 

in 2007. Languages such as 
C++ and Java built on C; and 
operating systems including 
Linux and OS X are based 


on Unix. 


|} RESEARCH 
Iran launch fails 


Iran’s bid to senda live 
monkey into space last month 
ended in failure, according to 
12 October reports quoting 
the Iranian Students’ News 
Agency. “The launch was 

not publicized as all of its 
anticipated objectives were 
not accomplished,’ deputy 
science minister Mohammad 
Mehdinejad-Nouri told the 
agency. He said that achieving 
such a launch was still a 
priority for the country’s 
space programme. 


Autism genomes 
A high-profile US advocacy 
group plans to create the 
world’s largest library of 
sequenced genomes from 


people with autism. Autism 
Speaks, based in New York, 
announced on 13 October that 
it will help to fund the BGI 
genome sequencing centre 

in Shenzhen, China, to fully 
sequence 10,000 genomes 
over the next two years. The 
project will cost US$40 million 
to $50 million, says Autism 
Speak’s vice-president for 
scientific affairs, Andy Shih. 

It will begin with a pilot 

phase aiming to sequence 

200 genomes by the end of 

the first quarter of 2012. See 
go.nature.com/i7xsan for more. 


ExoMars mission 
The European Space Agency 
(ESA) will forge ahead with 
ExoMars, an ambitious two- 
part robotic mission to look 
for signs of life on Mars, even 
though NASA cannot afford 
to provide a launch rocket for 
the first part of the mission. 
During a 12-13 October 
council meeting in Paris, ESA 
decided to begin negotiations 
with Russia for a rocket to 
launch the initial planetary 
orbiter stage of ExoMars in 
2016, in exchange for Russian 
participation in the mission. 
See go.nature.com/zimumq 
for more. 


Swedish funding 
One of Sweden's largest private 
research funders, the Knut and 
Alice Wallenberg Foundation 
in Stockholm, on 13 October 
announced plans to support 


A GLOBAL CARBON FOOTPRINT 


A study shows the origin of the carbon dioxide embodied in goods and 
services consumed by a country, in this case Japan. 
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24-28 OCTOBER 
Researchers gather in 
Denver, Colorado, at 
a conference arranged 
by the World Climate 
Research Program, to 
appraise the state of 
climate science and 
its biggest research 
challenges. 
go.nature.com/xcwaxa 


22-23 OCTOBER 
Open, collaborative 
models for drug 
research, personal 
genomics and scientific 
publishing are among 
the topics at the second 
Open Science Summit 
in Mountain View, 
California. 
opensciencesummit.com 


up to 125 young researchers 
for 5 years, in an initiative 
totalling 1.2 billion Swedish 
kronor (US$180 million). 

So far, 185 million kronor 

has been set aside for a first 
group of researchers, who will 
start in late 2012. Scientists 
will be nominated by their 
universities and selected by the 
foundation; each will be in line 
for up to 7.5 million kronor. 
The foundation awarded 

959 million kronor in research 
funding last year. 


Tsunami test 

After the first full-scale 
tsunami-warning exercise 
in the region, Indian Ocean 
nations say that they are 
ready to take control ofa 
US$100-million system set 
up in 2005. On 12 October, 
23 nations around the ocean 
received a bulletin about a 
(practice) tsunami. National 
focal points then spread the 
alert to coastal communities 
and emergency services. The 
test passed without major 
glitches. See go.nature.com/ 
yv9iws for more. 


> NATURE.COM 
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Mine, all mine: the rush to claim minerals and oil is driving China’s marine ambitions. 


Angry words over 
East Asian seas 


Chinese territorial claims propel science into choppy waters. 


BY DAVID CYRANOSKI 


lashes at sea. Disputed borders. It is not 
( the usual stuff of science. But research- 

ers and scientific journals are being 
pulled into long-simmering border disputes 
between China and its neighbours. Confron- 
tations involving research vessels are raising 
tensions in the region, while the Chinese gov- 
ernment is being accused of using its scientists’ 
publications to promote the country’s territo- 
rial claims. 

China’s desire to increase its exploitation of 
the sea is no secret. The country’s 12th five- 
year plan, which covers 2011-15 and was 
approved in March, was the first to mention the 


importance of a marine economy. In May, Chi- 
nas Ocean Development Report estimated that 
marine industries, including offshore oil and 
gas exploration, fisheries and ship building, 
will earn 5.3 trillion renminbi (US$830 billion) 
by 2020. Last month, Zhang Jixian, head of 
the Chinese Academy of Surveying and Map- 
ping, announced that the country will ramp up 
efforts to chart what he described as its “three 
million square kilometres of water territory’, 
an area much larger than that considered by 
neighbouring states to be Chinese territory. 
The mapping project 

will be aided by Chinas NATURE.COM 
first cartographic sat- Nature China: 
ellite, to be launched — www.nafure.com/nchina 


in December, and the Jiaolong submersible, 
which is scheduled to take humans to ocean 
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researchers are trying to make their coun- 
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a 2010 review of the impacts of climate change 
on water resources and agriculture in China, 
published in Nature’, included a map with 
an inserted area that implied that most of the 
South China Sea was part of China. 

Last month, in an online posting that was 
also sent to Nature and other journals, 57 Viet- 
namese scientists, engineers and other profes- 
sionals living around the world complained 
about the use of such maps. The letter laments 
the Chinese government's use of “‘back door’ 
tactics’, and argues that it is “using your maga- 
zine/journal as a means to legitimize such > 
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> [a] one-sided and biased map”. 
A map that appeared in a review of 
Chinese demography published in 
Science’ provoked similar criticism. 
Science responded with an Editor's 
Note’ stating that the journal “does 
not have a position with regard to 
jurisdictional claims” but that it is 


“reviewing our map acceptance pro- 
cedures to ensure that in the future 
Science does not appear to endorse or 
take a position on territorial/jurisdic- 
tional disputes”. 

Meanwhile, Michael Oppenheimer, 
a geoscientist at Princeton Univer- 
sity, New Jersey, who is co-editor of 
Climatic Change, has received a 
barrage of e-mails since June from sci- 
entists contesting a Chinese map that his jour- 
nal published more than four years ago’. The 
map includes a thick ‘cow-tongue shaped dot- 
ted line that claims for China a wide swathe of 
the South China Sea, reaching down towards 
Malaysian Borneo. The scientists, from Viet- 
nam, Finland, Canada and elsewhere, are 
demanding a correction to the map. But this 
kind of highly politicized debate over territory 
“js not a question that a journal like ours wants 
to deal with’, says Oppenheimer. 

Other Vietnamese scientists contacted by 
Nature were most angered by instances of 
what they consider to be gratuitous uses of the 
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cow-tongue map. “They include the line 
around the South China Sea even when 
this region, and the islands within it, have 
absolutely zero relevance to the topic,” says 
Q. Tuan Pham, a chemical engineer at the 
University of New South Wales in Sydney, 
Australia. 

Why Chinese scientists include the con- 
troversial map in their papers is not clear. 
Following the e-mails, Oppenheimer decided 
that the disputed map had no relevance to the 
conclusion of the paper in question, and sug- 
gested that the lead author, Xuemei Shao of the 
Institute of Geographic Sciences and Natural 


Resources Research in Beijing, change 
it. Shao refused, explaining in an 
e-mail that the figure “is requested by 
the Chinese government”. 

Jingyun Fang, a climate-change 
specialist at Peking University in 
Beijing who was a co-author on the 
Nature review, says that he included 
the insert because “we should follow 
China’s law to include these Chi- 
nese seas in the map”. Neither Fang, 
Shao nor any of four authors of other 
articles that included similar maps 
responded to requests from Nature 
for details of these regulations. 

Science, Nature and Climatic 
Change have ultimately decided 
not to remove the offending maps. 
But Tuan Nguyen, a professor of medicine at 
the Garvan Institute of Medical Research in 
Sydney, who has independently complained 
to journal editors about China’s maps of the 
South China Sea, says that maps in journals 
should be treated as scientific data and veri- 
fied before publication. “The publication of 
such a map represents an abuse of science,” 
he says. m SEE EDITORIAL P.285 
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US row threatens Chinese links 


Dispute intensifies over a ban on some types of scientific cooperation with China. 


BY EUGENIE SAMUEL REICH 


hen US presidential science adviser 
John Holdren hosted a dinner and 
meetings between US and Chinese 


science officials in May, he must have known 
it would lead to a high-level stand-off. That 
came to pass on 11 October, when the Govern- 
ment Accountability Office (GAO), an arm of 
Congress, concluded in a report that those 
activities violated legislation banning scientific 
cooperation with China by NASA and by the 
White House Office of Science and Technol- 
ogy Policy (OSTP), which Holdren directs. 
Frank Wolf (Republican, Virginia), the 
congressman who chairs the subcommittee 
that funds science agencies including the OSTP 
and NASA, inserted the ban into a spending bill 
that was passed last spring. Now, backed by the 
GAO report, he has asked the US Department 
of Justice to rein in Holdren’s China-related 
activities; if the department refuses to do so, 


the matter could end up in the courts. 
Holdren — armed with a memo from the 
justice department saying that he has the right 
to conduct diplomacy on behalf of US President 
Barack Obama, even without congressional 
approval — is showing no signs of backing 
down. Yet science- 


“This has policy experts say 
potential to cut that the dispute has 
off collaboration the potential to cast 
with a country a cloud over joint 
onarapidly academic and com- 
rising science mercial research 


efforts between the 
two economic super- 
powers. “This has 
potential to cut off collaboration with a coun- 
try on a rapidly rising science and technology 
trajectory,’ says Richard P. Suttmeier, a retired 
expert on Chinese science policy based in 
Keene Valley, New York. 

Relations between the United States and 


trajectory.” 
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China have their roots in a historic 1972 visit to 
Beijing by US president Richard Nixon. That led 
to a 1979 agreement between the two govern- 
ments for cooperation on scientific activities. 
Suttmeier estimates that US agencies now have 
more than 30 agreements on scientific coop- 
eration with their equivalents in the Chinese 
government. The US National Science Founda- 
tion (NSF) opened an office in Beijing in 2006, 
and the US Department of Energy founded a 
US$150-million Clean Energy Research Center 
with China in 2009. Chinese researchers are 
now more likely to collaborate and co-author 
papers with scientists from the United States 
than with those from any other country. 

“T dont understand the motivation for trying 
to cut off something of benefit to both sides, 
says Martin Briggs, a hydrogeology graduate 
student at Syracuse University in New York, 
who spent two months at Fudan University in 
Shanghai on a fellowship funded by the NSF to 
learn about water quality. 
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China have their roots in a historic 1972 visit to 
Beijing by US president Richard Nixon. That led 
to a 1979 agreement between the two govern- 
ments for cooperation on scientific activities. 
Suttmeier estimates that US agencies now have 
more than 30 agreements on scientific coop- 
eration with their equivalents in the Chinese 
government. The US National Science Founda- 
tion (NSF) opened an office in Beijing in 2006, 
and the US Department of Energy founded a 
US$150-million Clean Energy Research Center 
with China in 2009. Chinese researchers are 
now more likely to collaborate and co-author 
papers with scientists from the United States 
than with those from any other country. 

“T dont understand the motivation for trying 
to cut off something of benefit to both sides, 
says Martin Briggs, a hydrogeology graduate 
student at Syracuse University in New York, 
who spent two months at Fudan University in 
Shanghai on a fellowship funded by the NSF to 
learn about water quality. 


Wolf’s spokesman Daniel Scandling says that 
the congressman sought to ban NASA and the 
OSTP from fostering relations with China after 
NASA administrator Charles Bolden visited the 
country last year and invited Chinese officials 
to visit NASA facilities in return. “Congress- 
man Wolf is deeply concerned by China's 
spying and theft of technology and doesn't 
think it is wise to give the Chinese access to 
advanced space technologies,’ Scandling says. 

Few existing NASA activities will be 


curtailed by the ban. But because much of 
US science relies on the OSTP for high-level 
coordination, the ban could derail activities 
by more agencies than Wolf intends to tar- 
get. He has publicly questioned the wisdom 
of NSF activities involving China, but Scan- 
dling says that Wolf has no immediate plans 
to block cooperation by other agencies. Wolf’s 
subcommittee has already approved a bill that 
would extend the ban into the 2012 fiscal year; 
it is now awaiting passage through the House. 
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But Mu Rongping, a science-policy analyst 
at the Chinese Academy of Sciences in Bei- 
jing, doesn't expect a major withdrawal from 
China-US activities by the United States. It is 
clear that cooperation on funding of projects 
and mobility of researchers is win-win for 
both sides, he explains. 

“US bilateral activities in the fields of 
science, technology and innovation policy are 
important and necessary for both countries, 
not just for China,” he says. m 
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No quick fix for 
Haiti cholera 


As outbreak enters its second year, relief bodies move to 
roll out vaccine for the first time. 


BY DECLAN BUTLER 


year after cholera broke out in the 
Aten of the January 2010 Haiti 
earthquake, the epidemic has disap- 
peared from the headlines, but it continues 
to wreak a deadly toll. Mortality rates remain 
high in some areas, but donor funding for 
front-line response teams is drying up, even 
as a newly approved vaccine offers a glimmer 
of hope. 
The epidemic has already caused almost half 


a million cases, half of which have required 
hospitalization. Haiti’s health ministry says 
that roughly 6,500 people have died from their 
infections — a figure widely believed to be an 
underestimate. 

The Pan American Health Organization 
(PAHO) in Washington DC expects the 
epidemic to ease partially, but nonetheless 
predicts a further 250,000 cases next year, says 
Peter Graaff, the PAHO/World Health Organi- 
zation (WHO) representative in Haiti. 

Shanchol, a vaccine approved by the WHO 


for worldwide use on 29 September, could 
brighten the outlook. It was developed by the 
International Vaccine Institute in Seoul with 
support from the Bill & Melinda Gates Foun- 
dation, and is produced by Shantha Biotechnics 
in Hyderabad, India, a subsidiary of vaccine 
company Sanofi Pasteur. Two groups of non- 
governmental organizations (NGOs) operat- 
ing in Haiti now intend to include the vaccine 
in their own emergency responses to cholera. 

At US$1.85 a dose, Shanchol is cheaper 
than the only other currently available chol- 
era vaccine — Dukoral, made by Crucell, 
based in Leiden, the Netherlands. Shanchol is 
also given orally, and requires fewer doses in 
children under six, who are among the most 
vulnerable to cholera. The vaccine’s benefits 
“are really clear for the outbreak contexts in 
which MSF works’, says Julia Hill, vaccines 
policy adviser at Médecins Sans Frontiéres 
(MSBP), also called Doctors Without Borders, 
in Geneva, Switzerland. 

MSF has one of the largest ongoing emer- 
gency-response operations in Haiti, treating as 
many as one-third of all reported cholera cases. 
It intends to begin vaccinating with Shanchol 
soon in Haiti; it is also considering a vaccina- 
tion campaign in West and Central Africa, 
where cholera is currently resurging. MSF has 
yet to finalize details of its campaign, but will 
probably focus vaccination efforts on vulner- 
able groups, such as people in remote rural 
areas, who lack timely access to treatments. 

Because mountainous communities can 
often be reached only after half-a-day’s jour- 
ney on foot, “a whole village can be infected 
before help arrives’, says Michel Van Herp, an 
epidemiologist in the MSF office in Brussels. 
Van Herp oversaw MSF’s field response at the 
start of the epidemic, when as many as 9% of 
cases resulted in fatalities. In the capital, Port- 
au-Prince, that figure has now been brought 
down below 1%, but in remote rural areas, 
death rates as high as 5-10% persist, he says. 

A separate campaign aims to vaccinate 
100,000 vulnerable Haitians using 200,000 
doses of Shanchol. That effort will be carried 
out jointly by GHESKIO, 
an NGO based in Port- 
au-Prince, and Partners 
in Health (PIH), a non- 
profit organization in > 
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> Boston, Massachusetts, that is affiliated 
with health groups at Harvard University 
in Cambridge, Massachusetts. The group 
intends to launch its campaign in January 
2012, says Louise Ivers, a senior adviser to 
PIH working in Haiti. The lack of clean 
water and sanitation in Haiti, and the poor 
prospects for improving that infrastructure 
in the near future, makes for a “compelling 
argument” to use cholera vaccines, she says. 

The WHO opposed vaccination during 
the chaos of the initial cholera outbreak in 
Haiti; Claire-Lise Chaignat, coordinator of 
the WHO’s Global Task Force on Cholera 
Control, argued at the time that the logistics 
would have been impossible to manage (see 
Nature 468, 483-484; 2010). But the chaos 
has subsided, and the peaks in numbers of 
cholera cases are coming less frequently, so 
the WHO now supports targeted vaccina- 
tion in Haiti, says Chaignat. 

Eventually, the Haitian government will 
have to assume responsibility and run its 
own vaccination programmes, but per- 
suading them to do so could be tough, 
warns Van Herp — not least because there 
is little economic or political interest in 
the rural target communities that might 
benefit most. Van Herp and other experts 
also warn that cholera vaccination is no 
panacea. The focus of relief efforts must 
be on rehydrating the dangerously ill with 
a cheap and highly effective oral solution 
of glucose and salts, he says. Building 
infrastructure to provide clean water and 
sanitation is the only way to prevent cholera 
in the long term. 

But a reduction in donor support for 
NGO frontline responders is hindering 
these efforts, says Pascale Zintzen, MSF’s 
deputy head of mission in Haiti. “Differ- 
ent international groups dependent on 
donors are no longer receiving funds or 
having their funding renewed,” she says. 
At the same time, the government's health 
ministry has been slow to launch projects 
with the funds it has already received, she 
complains. Ivers adds, “Over the past few 
months, we have seen a number of cholera 
treatment centres closing down” 

Graaff says that Haiti needs to begin 
integrating routine cholera surveillance and 
treatment into the health services, mov- 
ing away from its reliance on emergency 
response. Ivers argues, however, that the 
Haitian government is years away from 
being able to make that transition, and that 
continued support from NGOs is essential. 

“T think it is a little misleading to 
suggest that it’s OK that there is less sup- 
port, because the government is somehow 
going to take over,” she says. “Does it have 
the money, the supply chain and the human 
resources to provide the services that are 
needed? I don't think the ministry of health 
has any of those things.” = 
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Milan’s San Raffaele Scientific Institute may have to shut down if it can’t be rescued from bankruptcy. 


BIOMEDICAL RESEARCH 


Vatican bids for 
Italian institute 


Debt and death hit San Raffaele research centre. 


BY ALISON ABBOTT 


ne of Italy’s most prestigious bio- 
() medical research centres now faces 
bankruptcy, against a backdrop of 
rumours fed by intrigue among power-bro- 
kers, allegations of fraud and corruption, and 
a violent death. Next week, a court will decide 
whether to leave the Milan-based San Raffaele 
Scientific Institute to its fate, or allow a con- 
sortium led by the Vatican Bank to rescue it. 
The institute, which includes a 1,400-bed 
research hospital, a basic-research centre and 
a small university, has run up debts approach- 
ing €1.5 billion (US$2.1 billion). The debt has 
been attributed to poor investment decisions 
and overzealous expansion, and criminal 
investigations have begun into alleged corrup- 
tion in the institute's financial dealings. Most 
granting agencies have now suspended pay- 
ments to the scientists there, and many com- 
panies are no longer supplying consumables. 
“We are still working, but in low gear,” says 
immunologist Maria Grazia Roncarolo, the 
institute's scientific director. 
Scientists inside and outside Italy are 
appalled. “The San Raffaele is one of the most 
high-quality research centres in Italy, perhaps 
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in Europe,” says Paul Herrling, chair of the 
institute's scientific advisory board and head 
of developing-world medicine at Novartis 
in Basel, Switzerland. “It has one of the best 
set-ups for gene therapy in the world — it 
needs to be saved.” 

The charismatic figure at the centre of the 
story is 91-year-old Luigi Maria Verzé, the 
Catholic priest who founded the hospital in 
1971 and formally remains chair of the San Raf- 
faele del Monte Tabor Foundation, which runs 
the institute. He added on a large building in 
1992 to expand the facilities for basic research. 
In 1996, he founded a university on campus 
with three faculties — medicine, psychology 
and philosophy — in alignment with his belief 
that humans comprise body, mind and soul. 

The San Raffaele Scientific Institute is none- 
theless constitutionally secular, and Verzé has 
never imposed restrictions on research or 
teaching activities. His acknowledged skill in 
appointing productive scientists — frequently 
interviewing them himself — allowed the 
research centre, home to 221 permanent and 

332 contract scientists, to 
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> Boston, Massachusetts, that is affiliated 
with health groups at Harvard University 
in Cambridge, Massachusetts. The group 
intends to launch its campaign in January 
2012, says Louise Ivers, a senior adviser to 
PIH working in Haiti. The lack of clean 
water and sanitation in Haiti, and the poor 
prospects for improving that infrastructure 
in the near future, makes for a “compelling 
argument” to use cholera vaccines, she says. 

The WHO opposed vaccination during 
the chaos of the initial cholera outbreak in 
Haiti; Claire-Lise Chaignat, coordinator of 
the WHO’s Global Task Force on Cholera 
Control, argued at the time that the logistics 
would have been impossible to manage (see 
Nature 468, 483-484; 2010). But the chaos 
has subsided, and the peaks in numbers of 
cholera cases are coming less frequently, so 
the WHO now supports targeted vaccina- 
tion in Haiti, says Chaignat. 

Eventually, the Haitian government will 
have to assume responsibility and run its 
own vaccination programmes, but per- 
suading them to do so could be tough, 
warns Van Herp — not least because there 
is little economic or political interest in 
the rural target communities that might 
benefit most. Van Herp and other experts 
also warn that cholera vaccination is no 
panacea. The focus of relief efforts must 
be on rehydrating the dangerously ill with 
a cheap and highly effective oral solution 
of glucose and salts, he says. Building 
infrastructure to provide clean water and 
sanitation is the only way to prevent cholera 
in the long term. 

But a reduction in donor support for 
NGO frontline responders is hindering 
these efforts, says Pascale Zintzen, MSF’s 
deputy head of mission in Haiti. “Differ- 
ent international groups dependent on 
donors are no longer receiving funds or 
having their funding renewed,” she says. 
At the same time, the government's health 
ministry has been slow to launch projects 
with the funds it has already received, she 
complains. Ivers adds, “Over the past few 
months, we have seen a number of cholera 
treatment centres closing down” 

Graaff says that Haiti needs to begin 
integrating routine cholera surveillance and 
treatment into the health services, mov- 
ing away from its reliance on emergency 
response. Ivers argues, however, that the 
Haitian government is years away from 
being able to make that transition, and that 
continued support from NGOs is essential. 

“T think it is a little misleading to 
suggest that it’s OK that there is less sup- 
port, because the government is somehow 
going to take over,” she says. “Does it have 
the money, the supply chain and the human 
resources to provide the services that are 
needed? I don't think the ministry of health 
has any of those things.” = 
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Milan’s San Raffaele Scientific Institute may have to shut down if it can’t be rescued from bankruptcy. 


BIOMEDICAL RESEARCH 


Vatican bids for 
Italian institute 


Debt and death hit San Raffaele research centre. 


BY ALISON ABBOTT 


ne of Italy’s most prestigious bio- 
() medical research centres now faces 
bankruptcy, against a backdrop of 
rumours fed by intrigue among power-bro- 
kers, allegations of fraud and corruption, and 
a violent death. Next week, a court will decide 
whether to leave the Milan-based San Raffaele 
Scientific Institute to its fate, or allow a con- 
sortium led by the Vatican Bank to rescue it. 
The institute, which includes a 1,400-bed 
research hospital, a basic-research centre and 
a small university, has run up debts approach- 
ing €1.5 billion (US$2.1 billion). The debt has 
been attributed to poor investment decisions 
and overzealous expansion, and criminal 
investigations have begun into alleged corrup- 
tion in the institute's financial dealings. Most 
granting agencies have now suspended pay- 
ments to the scientists there, and many com- 
panies are no longer supplying consumables. 
“We are still working, but in low gear,” says 
immunologist Maria Grazia Roncarolo, the 
institute's scientific director. 
Scientists inside and outside Italy are 
appalled. “The San Raffaele is one of the most 
high-quality research centres in Italy, perhaps 
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in Europe,” says Paul Herrling, chair of the 
institute's scientific advisory board and head 
of developing-world medicine at Novartis 
in Basel, Switzerland. “It has one of the best 
set-ups for gene therapy in the world — it 
needs to be saved.” 

The charismatic figure at the centre of the 
story is 91-year-old Luigi Maria Verzé, the 
Catholic priest who founded the hospital in 
1971 and formally remains chair of the San Raf- 
faele del Monte Tabor Foundation, which runs 
the institute. He added on a large building in 
1992 to expand the facilities for basic research. 
In 1996, he founded a university on campus 
with three faculties — medicine, psychology 
and philosophy — in alignment with his belief 
that humans comprise body, mind and soul. 

The San Raffaele Scientific Institute is none- 
theless constitutionally secular, and Verzé has 
never imposed restrictions on research or 
teaching activities. His acknowledged skill in 
appointing productive scientists — frequently 
interviewing them himself — allowed the 
research centre, home to 221 permanent and 

332 contract scientists, to 
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competitive research grants and €18 million 
in research funding from the Italian minis- 
try of health, and published more than 800 
scientific papers. It also carried out some 
250 clinical protocols, half of which related 
to discoveries made in its own labs in areas 
such as gene and stem-cell therapy. It also has 
several large strategic partnerships with the 
pharmaceutical industry. 

Verzé has exported his concept to 15 other 
countries, including the 500-bed Sao Rafael 
Hospital in Bahia, Brazil. In 2005, he began 
a major expansion programme at the Milan 
campus, constructing three large buildings 
for the research centre and university and for 
start-up companies, although scientists have 
yet to move into the latest of these facilities. 

In February, newspapers first started to 
report on the institute’s debts, then believed 
to be around half a million euros. “At first we 
werent so worried because we thought it would 
sort itself out,’ says Giulio Cossu, director of 
San Raffaele’s Stem Cells Research Institute. 

But at the end of June, the debt was revealed 
to be closer to €1 billion. “We began to realize 
the situation was serious,’ says Cossu. The board 
of the San Raffaele del Monte Tabor Founda- 
tion was replaced and most of the executive 
power was transferred to its new vice-chair, 
Giuseppe Profiti, who is director of the Vatican’s 
Paediatric Hospital of the Baby Jesus in Rome. 


The situation soon worsened. On 18 July, 
Verzé’s right-hand man, Mario Cal, shot 
himself in his San Raffaele office. He died on 
an operating table at the hospital. Since then, 
Italian newspapers have reported that the 
foundation had invested heavily, and perhaps 
unwisely, in real estate, energy, various hotels 
and even a mango plantation. 

The proposed rescue package is led by the 
Vatican Bank and a businessman, Vittorio 
Malacalza, who plan to absorb €500 million 
of debt and invest a further €250 million. The 
reason behind the Vati- 


can’s interest is unclear: “The San 

atodds withthe Vatican Raffaele is at 

nae leaden as - ve sialic 
crossroad.” 


such as euthanasia, which 
he does not regard asa sin. 

The rescue consortium has told scientists 
at the San Raffaele that it plans to keep the 
research and hospital complex intact and 
shed investments unrelated to health. A Milan 
court will decide on 26 October whether the 
proposed investment and restructuring is 
sufficient to avoid bankruptcy. 

The threatened bankruptcy means that 
most payments to the institute have been 
stopped. The frozen funds include millions 
of euros in grants from the European Union's 
Seventh Framework programme; the Italian 
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Association for Cancer Research, a charity 
based in Milan; and the European Research 
Council (ERC). The Telethon Foundation, 
however, a research charity in Rome that has 
approved grants worth €7 million, is now pay- 
ing grantees directly, rather than through the 
institute. 

One of the institute’s seven ERC grant- 
holders has been unable to begin his work; 
another has left; and a third is planning to do 
so, because of the problems, says Roncarolo. 
Late last month, representatives of computer 
manufacturer Hewlett Packard arrived at 
the institute’s genomics centre to take back 
leased data-storage instrumentation until the 
finances are sorted out. 

Although the institute's scientists are eager 
to be rescued, Roncarolo says that they are 
also worried about whether the new owners 
will understand that it is the mixture of basic 
and clinical science under one roof that makes 
the hospital so strong. 

“The San Raffaele is at a dangerous cross- 
road,’ says Aaron Ciechanover, a Nobel laure- 
ate and chemist at Technion, Israel’s Institute 
of Technology in Haifa, and a member of 
the San Raffaele’s scientific advisory board. 
“T hope that the court will approve the rescue 
plan, and that the new owners will realize what 
a jewel they have and understand they need to 
keep the excellent research going.” m 
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LOBBYING IN FLUX 
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After several years of growth, the total spent on advocacy by the US education industry, including research universities, has levelled off (left). 
Figures for selected institutions and scientific societies reveal a more complex picture (centre, right). 
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Lobbyists confront 
US budget crunch 


Money for advocacy is tight as federal cuts loom. 


BY EUGENIE SAMUEL REICH 


he business of lobbying the US govern- 

ment on behalf of research has rarely 

been more challenging, according to 
an informal survey. A polarized Congress 
focused on reducing government spending, 
coupled with anti-science sentiment among 
some lawmakers, has created a chilly atmos- 
phere for those arguing for robust and long- 
term investment in basic research. 

“The environment is toxic. It’s dysfunc- 
tional. It’s like it’s always been, but worse,” 
says Mike Lubell, who oversees government 
relations at the American Physical Society in 
Washington DC. And it is leading lobbyists to 
adopt new tactics. 

Lobbyists have conventionally advocated 
for their own specific programmes and agency 
budgets. But in recent weeks they have been 
aiming their appeals higher, spurred by fears 
of an across-the-board cut to discretionary 
spending — the portion of the federal budget 
that includes science. Such a cut could hap- 
pen if the Joint Select Committee on Deficit 
Reduction, set up by Congress in August to 
identify ways to trim the US federal deficit, 
fails to find a formula that satisfies both par- 
ties. The committee has until 23 November to 
arrive at a plan, but with Republicans opposing 
tax increases and Democrats unwilling to cut 
entitlement programmes such as Medicare and 
Social Security, its efforts could prove fruitless. 
In that case, legislation stipulates that cuts will 
be imposed on all federal agencies for the 2013 
budget (see Nature 476, 133-134; 2011). 


On 21 September, a letter signed by more 
than 130 university presidents was sent to the 
joint select committee asking it to reach a “big 
agreement” so as to spare discretionary spend- 
ing from drastic cuts. This kind of high-level 
lobbying would not be necessary in a normal 
year, says Jennifer Poulakidas, vice-president for 
congressional and governmental affairs at the 
Association of Public and Land-Grant Universi- 
ties, a signatory of the letter. “Tt goes further than 
dealing with our own interests,’ she says. 

Anti-science rhetoric on the political right — 
for example, an attack in May by Senator Tom 
Coburn (Republican, Oklahoma) on research 
grants given by the US National Science Foun- 
dation — has prompted many lobbyists to seek 
new ways to reach lawmakers away from the 
partisan atmosphere of Washington DC. Jen- 
nifer Zeitzer, director of legislative affairs at the 
Federation of American Societies for Experi- 
mental Biology in Washington DC, says that 
lobbyists are making a major effort to persuade 
scientists to approach their representatives in 
their home districts. “It’s the first time we're 
trying to get members of Congress at home,” 
she says, pointing to a 19 September event in 
Birmingham, Alabama, at which National Insti- 
tutes of Health staff and researchers funded by 
the agency stressed the agency's crucial role in 
advancing medical science to congressional staff 
working for Democratic and Republican mem- 
bers of the House of Representatives. 

The assumption is that legislators will feel 
more sympathetic to science once they realize 
that federal research dollars flow back to major 
employers in their districts. “We argue federal 


funding will be conducive to job creation,” says 
Gene Irisari, director of government affairs at 
the Dallas-based company Texas Instruments, 
which is a member of the Task Force on Ameri- 
can Innovation, a coalition of universities and 
companies that advocate for physical-sciences 
research. On 21 September, the group organ- 
ized an event on Capitol Hill on the science 
that went into Apple's iPad computer tablet; the 
room was packed out. Rather than approach- 
ing representatives at home, the idea was to 
draw on the frenetic energy of Capitol Hill 
with a topic that crossed political boundaries. 

While the challenges multiply, money for 
science lobbying is tight. Spending by the US 
education industry, which devoted $100 mil- 
lion to lobbying in 2010, has hit a plateau after 
nearly a decade of growth (see chart). David 
Moore of the Association of American Medical 
Colleges, which has spent nearly $1.2 million 
on lobbying so far this year, says that although 
the association itself remains flush he has 
heard that other advocacy coalitions are scal- 
ing back because their members cannot afford 
to keep up membership fees. 

That plateau is underscored by declines in 
lobbying expenditure for some key institutions. 
For example, following a peak in 2007, spend- 
ing at Harvard University in Cambridge, Massa- 
chusetts, is falling, as is that at the University of 
California system, which peaked in 2008. Data 
for scientific societies tend to oscillate as their 
approaches change from year to year. Michael 
McPhaden, president of the American Geo- 
physical Union, says the society made a con- 
scious decision in 2010 to become more actively 
engaged after several years of little activity. 

All are haunted by the possibility that the 
lean times for science are more than a tempo- 
rary blip resulting from the bad economy. They 
fear that the steady budgetary growth that 
the entire US scientific enterprise has come 
to rely on is in jeopardy, says David Korn, a 
pathologist at Harvard University who has fol- 
lowed science budgets as an administrator and 
research-funding advocate for four decades. If 
it is, says Korn, “how do you downsize with 
minimal damage? That's the question all of us 
are worried about.” m SEE EDITORIAL P.285 
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DEMOGRAPHICS 


Seven billion and counting 


A look behind this month’s global population landmark reveals a world in transition. 


BY JEFF TOLLEFSON 


hat’s in anumber? This month, the 

world’s attention turns to a big one: 

7 billion, the latest milestone in 
humanity’s remarkable and worrying rise in 
population. According to a calculation based 
on surveys, censuses, probabilistic analyses, 
human sleuthing, expert opinion, heated 
academic debates and not a little guesswork, 
31 October 2011 is the most likely day that the 
number of living humans will top that figure. 
The date is a rough marker. Assuming a 1% 
margin for error, the milestone might have 
arrived earlier this year, or may not be reached 
until 2012. 

“It’s a relatively simple accounting frame- 
work: only birth, death and migration can 
change a population,” says Thomas Buettner, 
assistant director of the Population Division of 
the United Nations Department of Economic 
and Social Affairs in New York City, where 
the calculation was carried out ona standard 
desktop PC. “But there are a lot of conflicting 
data as well” 

Errors and patchy data contribute to the 
uncertainty, but demographers must also deal 
with institutional bias (schools the world over 
have an interest in boosting their enrolment 
numbers), as well as political motivations. 
China’s one-child policy, for instance, is the 
source of endless confusion: estimates of the 
actual fertility rate vary from 1.3 to 1.8 children 
per female, Buettner says, with vast conse- 
quences for projections of its future population. 

On one level, a figure of 7 billion is incred- 
ible for the sheer momentum it represents: a 
full doubling of the planet’s population since 
1967, with current growth adding 200,000 
people each day, and a nation larger than the 
size of France each year. But although the 
6-billion mark was reached about 13 years 
ago according to revised figures, it will take 
nearly 14 years to hit 8 billion (see ‘Snapshots 
of growth). The comparison shows that pop- 
ulation growth is decelerating; it is likely to 
level off at about 10 billion before the end of 
the century. 

Between now and then, the fastest growth 
will be in Africa, where fertility levels remain 
higher than anywhere else in the world. Popu- 
lation levels among industrialized countries, 
by contrast, will remain relatively constant. 
Although Asia will remain the most populous 
continent, decreasing fertility rates there will 
add to the overall ‘greying’ of the planet. 

Because high fertility is linked with poverty, 


the big worry globally is the 3.4 billion people 
who survive on less than US$2 per day, says 
Joel Cohen, who heads the Laboratory of Pop- 
ulations at the Rockefeller University in New 
York. The average number of children per 
woman in the least developed countries is 4.5, 
compared with 1.7 for developed countries, 
Cohen says, and this means that most of the 
additions to the global count are born where 
there is little access to energy and education, 


SNAPSHOTS OF GROWTH 


further fuelling the population steamroller. 
This reality underscores the importance of 
international development efforts: small 
increases in wealth and education can lower 
fertility and dramatically ease the burden of 
population growth decades down the road. 

As seen through the matter-of-fact perspec- 
tive of the statistician: “If you are successful 
now, says Buettner, “the benefits accumulate 
over time.” m 


By the time the next billion people are added to the world’s population it will not only be a more crowded 
planet but a perceptibly different one in terms of geography, age and modes of living. Yet even a small 
divergence from projected fertility rates could spell very different outcomes. 


Worldwide population 


@ Population will 
me probably reach [_—_————— 
8 billion by 2025 


14°" — 0.5 children per female more than projected 
5 12... ——= Projected most likely fertility scenario 
® —— 0.5 children per female less than projected 
a5 
o= 8 Bh 
22 
a soos 
2 
= In 1998 population 

2. reached 6 billion 

0) 

1 


Oceania 


Estimated and projected 
population by region (billions) 
Ww 


URBANIZATION 


Between the last ‘billion’ milestone and 
the next, the number of city dwellers 
will grow by roughly 1.8 billion. 
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AGEING 


Despite pockets of youth in developing countries where 
populations are booming, the world is generally getting 
older as people live longer. 
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Spies to use Twitter as crystal ball 


US intelligence agency aims to forecast unrest by reading the runes of social media. 


BY SHARON WEINBERGER 


that can predict future events such as riots, 

political upheavals and the outbreak of 
wars. Last week, a collection of academics and 
private businesses was scrambling to meet the 
deadline for proposals for research aiming to 
do just that. 

The Intelligence Advanced Research Pro- 
jects Activity (IARPA), a research arm of the 
US intelligence community, is sponsoring the 
work under the Open Source Indicators (OSI) 
programme. The three-year project, with an 
unspecified budget, is designed to gather digi- 
tal data from a range of sources, from traffic 
webcams to television to Twitter. The goal, 
according to IARPA, is to provide the intel- 
ligence community with predictions of social 
and political events that can “beat the news”. 

Initially, the OSI project will focus on Latin 
America, which has abundant publicly avail- 
able data and offers a convenient test bed for 
researchers’ models. Those models will build 
on strategies that have already shown prom- 
ise for predicting disease outbreaks and con- 
sumer behaviour, and which are becoming 
increasingly popular with US national security 
agencies (see Nature 471, 566-568; 2011). 

Indeed, the OSI project is one of many 
being sponsored by the US national security 
community, which seeks to meld mathemat- 
ics, computer science and economics with 
the social sciences, creating a new field of 
social and political forecasting that has often 
been compared to Isaac Asimov's concept of 
‘psychohistory. 

At the Center for Collective Intelligence at 
the Massachusetts Institute of Technology in 
Cambridge, computer scientist Peter Gloor has 
been working with colleagues to build models 
that can predict consumer behaviour, such as 
ticket sales for Hollywood films, using a range 
of online sources including social media. 
“We're up to 90% accuracy” for predicting 


> 
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box-office returns, says Gloor, who is part of a 
team applying for OSI funding. 

John Brownstein, an epidemiologist at 
Harvard Medical School in Boston, Massa- 
chusetts, is working with a group that analyses 
international news sources, government data 
and social media to provide an early warning 
of disease outbreaks. He is also applying to 
work on the IARPA project. “In many cases, 
what we are searching for are patterns of activ- 
ity that would not only apply to disease events 
but to conflicts, environmental disasters and 
other forms of social dis- 


ruption,” he says. “Just because 
However, there is data are 
disagreement about how available 
effective such biosurveil- doesn’t make 
lance programmes have them good, or 
been. Picking up hintsof thekey to the 


an HIN1 outbreak is less 
useful if it is swamped 
by false alarms, says James Wilson, a medi- 
cal doctor and the co-founder of Ascel Bio in 
New York. “There is a huge difference between 
a computer harvesting data and beginning to 
move communications so that people begin to 
contemplate action,’ says Wilson, whose com- 
pany is applying for OSI funding. 


kingdom.” 


DATA CRUNCHING 
The approach is nevertheless catching on in 
the financial sector. Christopher Ahlberg, the 
chief executive of Recorded Future in Bos- 
ton, also applying to the OSI project, says his 
company has proprietary software that can 
crunch through 300,000 sources an hour for 
clues to future stock movements. The com- 
pany has already received investment from 
In-Q-Tel, a venture-capital firm in Arlington, 
Virginia, founded by the US Central Intel- 
ligence Agency, and is mining online data 
sources, such as Twitter, in an effort to predict 
cyber attacks and developments in the ongoing 
‘Occupy Wall Street’ protests. 

There is already plenty of published 


literature on predicting stock market volatility 
based on open sources, says Kalev Leetaru, a 
computer scientist at the University of Illinois 
at Urbana-Champaign. But that’s a long way 
from being able to predict a riot next Wednes- 
day. Leetaru, who is also part of a team apply- 
ing to be part of the OSI project, says he asked 
IARPA whether it would consider a broader 
approach that involved predicting trends, such 
as the Arab Spring movement, rather than 
focusing on specific events (see Nature http:// 
dx.doi.org/10.1038/news.2011.532; 2011). The 
answer, he says, was “no”. 

“In my mind,” Leetaru says, “this would 
be akin to the National Institutes of Health 
announcing it was terminating all funding 
for work on cancer drugs that targeted indi- 
vidual cancers or that slowed cancer down, and 
instead announcing that they would only fund 
a single miracle pill that you take and your can- 
cer is cured the next morning” 

Robert Albro, an anthropologist at American 
University in Washington DC, and an expert 
in Latin American social movements, believes 
that IARPA is mistakenly presuming that social 
media will provide high-quality data. “Just 
because data are available doesn’t make them 
good, or the key to the kingdom,’ he says. 

A bigger question is whether models used 
to measure consumer preferences and disease 
outbreaks are applicable to the complex world 
of social change and political events. Albro 
says that such models make faulty assumptions 
about what motivates humans, and he worries 
“that companies concerned about consumer 
behaviour are now driving how IARPA thinks. 
That’s a leap of faith.” m 


CORRECTION 

The ‘Beyond the bomb’ issue of Nature 

(22 September 2011) inadvertently omitted 
the credit for the cover illustration. The 
image was produced by Viktor Koen. 
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MUSEUM OF LONDON ARCHAEOLOGY 


String theory finds 
a bench mate 


BY ZEEYA MERALI 


The exotic theory of everything could shed light 
on the behaviour of real materials, thanks to 


r4 ¢ O none side,” says Jan Zaanen, “you have this refined, almost 
other-worldly intellectual — the perfectionist obsessed with 
detail, barely interested in earthly pleasures. On the other, 
you have the loud, boisterous, sometimes aggressive, business-savvy 
character who knows how to get his hands dirty.” 

It might almost be a description of the misfit roommates known on 
stage, screen and television as The Odd Couple. But Zaanen, a condensed- 
matter physicist at the University of Leiden in the Netherlands, is actually 
describing the pairing of two groups of scientists: string theorists, who 
spend their days pursuing a rarefied, highly mathematical ‘theory of eve- 
rything; and his own colleagues — a considerably more grounded bunch 
who prefer to focus on how real-world materials behave in the laboratory. 

The scientists trying to bridge these disciplines are motivated by the 
discovery of a startling coincidence: suitably interpreted, the equa- 
tions of string theory can be a powerful tool for analysing some exotic 
states of matter, ranging from super-hot balls of quarks and gluons to 
ultracold atoms. The past year alone has seen at least four international 


302 | NATURE | VOL 478 | 20 OCTOBER 2011 


an unexpected mathematical connection 
with condensed-matter physics. 


workshops designed to stimulate collaborations across the disciplinary 
divide, including one hosted by Zaanen in Leiden. 

Sceptics still question whether this strange alliance will actually lead 
to new insights, or whether it is just a marriage of convenience. String 
theory does hint at the existence of many new states of matter, for exam- 
ple. But those predictions will be difficult to verify, and decisive experi- 
mental tests are only now in the planning stages. 

For the time being, the advantage to both partners is clear. String 
theory, long criticized for having lost touch with reality, gets experi- 
mental credibility. And condensed-matter physics, never the media dar- 
ling that string theory has been, gets a new mathematical tool — and a 
chance to baskin new-found glamour. 

The match-making began a dozen years ago with the reunion of Dam 
Thanh Son and Andrei Starinets, who had been undergraduates and 
dorm-mates at Moscow State University in the 1980s. The friends had 
lost touch with each other when they left Russia after the fall of com- 
munism in 1991. But in 1999, Son got a job at Columbia University 
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in New York City, and heard that Starinets was doing a PhD in string 
theory just a few kilometres away at New York University. So Son went 
to pay Starinets a visit. 

Collaboration was the farthest thing from his mind. String theory is 
mathematically rich and has an undeniable aesthetic appeal. But it is 
all about what physics might be like at scales of 10~*° metres — the idea 
being that seemingly point-like elementary particles such as quarks and 
electrons will actually turn out to be tiny, vibrating threads of energy 
when viewed at such scales. But these strings would be about 20 orders 
of magnitude smaller than a proton, putting the theory hopelessly 
beyond the reach of any direct experimental test. Son's speciality, by 
contrast, was firmly rooted in experiment: he was trying to understand 
the properties of quark-gluon plasmas, the short-lived, super-hot fire- 
balls that form when heavy nuclei such as gold are smashed together in 
accelerators. All this stringy stuff seemed utterly alien. 

Except that, when Son saw the string-theory calculations that Stari- 
nets had been working on with fellow PhD student Giuseppe Policastro, 
he recognized the equations as the same ones he had been using to 
analyse the plasma. 

Son immediately had to know what was going on, and Starinets 
began to explain. Starinets and Policastro had been working on an 
idea proposed in 1997 by Juan Maldacena, a physicist at Harvard Uni- 
versity in Cambridge, Massachusetts. Maldacena, now at the Institute 
for Advanced Study in Princeton, New Jersey, had realized that string 
theory predicts a mathematical equivalence between two hypothetical 
universes, one of which would be similar to our own. It would have the 
same three dimensions of space and one dimension of time, for example, 
and be filled with much the same types of elementary particle, which 
would, in turn, obey familiar-looking (to physicists) quantum-field 
equations. But it would not contain strings — or gravity. 

The other universe would be the opposite: it would contain both strings 
and gravity — indeed, the gravity could get strong enough to form black 
holes — but no elementary particles. It would also have an additional 
dimension of space. 

Maldacena’s insight was simple, if audacious: take any process involv- 
ing particles and fields in the first universe, he said, and it could equally 
well be described as a process involving gravity, black holes and strings 
in the second universe — and vice versa’. The equations might look 
very different. But the fundamental physics would be exactly the same. 

That was why Son was seeing quark—gluon equations in a string- 
theory calculation, Starinets explained: they were the three-dimensional 
equivalent of the gravitational fields that he and Policastro had been 
studying in the four-dimensional universe. 


Marriage of convenience 

All this jumping back and forth between universes was weird even by 
string-theory standards (and even weirder for non-string theorists, as 
Maldacena had showed that the mapping worked not just between three 
and four dimensions of space, but also between four and five, five and 
six and so on). But as Son and Starinets talked, they began to see that 
Maldacena’s mapping might be a powerful problem-solving strategy. 
They could start with a messy set of quantum-field calculations in our 
real, three-dimensional world — the quark-gluon plasma equations, 
say — then map those into the four-dimensional world, in which the 
equations tend to be much easier to solve. Then they could map the 
results back to the three-dimensional world and read off the answer. 

It worked. “We turned the calculation on its head to give us a predic- 
tion for the value of the shear viscosity of a plasma,’ says Son, referring 
to a key parameter of the quark-gluon fireball’. “A friend of mine in 
nuclear physics joked that ours was the first useful paper to come out 
of string theory,” he says. 

In 2008, the team’s predictions were confirmed’ at the 
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says Steve Gubser, a string theorist at Princeton University, and one of 
the early champions of applying the principle to real-world problems. 

The team’s success also caught the attention of Subir Sachdev, 
a condensed-matter theorist at Harvard. Just as Son had seen a plasma 
reflected back at him in Starinets’ equations, Sachdev saw quantum 
critical-phase transitions — the changes of state that occur in materi- 
als when they near absolute zero, when quantum-mechanical effects 
begin to dominate. “They were using different words,” he says, “but it 
was the same physics.” 

Sachdev hoped that Maldacena’s idea could provide him and his fellow 
theorists with some much-needed help in exploring this chilly realm. 
Over the decades, experimentalists had discovered a long list of exotic, 
quantum-dominated states — including superconductors that allow cur- 
rent to flow without resistance; superfluids that have no viscosity and can 
creep up the walls of beakers; Bose-Einstein condensates made up of 
atoms moving in step like a single ‘super atom; and ‘strange’ metals that 
behave in ways subtly different from ordinary metals. But physicists still 
have no way to predict what will turn up in the lab next. “We can't even 
answer the fundamental question of how many phases of matter exist,” 
says Sean Hartnoll, a string theorist at Stanford University in California. 

Sachdev’s first efforts to apply Maldacena’s idea to laboratory materi- 
als had resulted in two papers he co-authored in 2007, one with Son and 
his colleagues’, and another with a team that included Hartnoll’. Since 
then, Sachdev and his collaborators have built up a recipe for mapping 
the conductivity of strange metals into the properties of black holes in 
the string theorists’ four-dimensional universe — a strategy that string 
theorist John McGreevy at the Massachusetts Institute of Technology 
in Cambridge® and others are also pursuing. These groups get answers 
that broadly reproduce the peculiar low-temperature behaviour of the 
metals. They have also mapped the behaviour of four-dimensional black 
holes in string theory to the conditions at which many materials will 
change phase into states other than the familiar solid, liquid and gas’. 
“We now have a whole new hammer for attacking the problems I have 
been working on for 20 years,’ says Sachdev (see ‘An unexpected link’). 

Sachdev’s involvement, in turn, has helped to ignite the interest of 
other condensed-matter physicists. “A lot of us got into this field because 
of the force of Subir’s personality and his reputation — we realized that if 
he was taking this seriously, maybe we should too,’ says Andrew Green, 
a condensed-matter physicist at the University of St Andrews, UK, who 
co-organized a workshop on the correspondence at Imperial College 
London in January. 

The condensed-matter results also got the string theorists excited — 
eventually. The field had been generally unenthusiastic about following 
up on the quark-gluon plasma calculations, says Clifford Johnson, a 
string theorist at the University of Southern California in Los Angeles. 
And at least part of the reason, he suspects, was a bias against sullying 
string theory’s purity. “There was a snobbery among some towards what 
was termed ‘mere applications’? he says. 

But in 2006, string theory took a public battering in two popular 
books: Not Even Wrong by Peter Woit, a mathematician at Columbia, 
and The Trouble With Physics by Lee Smolin, a physicist at the Perim- 
eter Institute for Theoretical Physics in Waterloo, Canada. Both books 
excoriated the theory’s isolation from experiment. 

“Tt’s hard to say whether the interest in condensed-matter applications 
is a direct response to those books because that’s really a psychological 
question,” says Joseph Polchinski, a string theorist at the Kavli Institute 
for Theoretical Physics in Santa Barbara. “But certainly string theorists 
started to long for some connection to reality.” 

The condensed-matter partnership seemed perfect for that. If nothing 
else, it promised to make a virtue out of string theory’s embarrassment of 
riches — the roughly 10°” solutions to its basic equations, each of which 

describes a possible universe with its own size, shape, dimen- 


Relativistic Heavy Ion Collider at Brookhaven National Lab- NATURE.COM sionality and physical laws. Through Maldacena’s idea, says 
oratory in Upton, New York. “These were strong quantitative Hearmoreaboutthis string theorist Jerome Gauntlett at Imperial College London, 
results, and they still stand today as the best results achieved _storyinourpodcast: “each solution can be expressed in the countless materials yet 
by the programme to relate string theory to experiment,” _go.nature.com/jnpege to be discovered”. 
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An unexpected link 


: 


String theory can help condensed-matter 
physicists to predict the conditions under 
which materials will change phase, 
among other phenomena. 


String theory incorporates additional 
dimensions of space, and contains 
gravity, strings and black holes. 
A black hole emits 
strings, much as a lump 
of hot coal radiates heat. 


On the three-dimensional 
(3D) ‘surface’ of a four- 
dimensional (4D) space, the 
behaviour of the strings 
corresponds to the behaviour 
of subatomic particles. 


The rewards are mutual, says Zaanen. “If] talk about superconductors 
and black holes in a colloquium, folk are attracted to it like bees to 
honey,’ he says. “It’s now bringing young blood to condensed-matter 
physics, as their first choice.” 

The flurry of workshops promoting the partnership have been very 
productive, agrees Polchinski. He co-organized a meeting at the Kavli 
Institute last year that sparked seven new collaborations, and he is run- 
ning another from August through November of this year. “It is unique 
to try to actively make two groups of physicists collaborate so quickly 
— haven't personally seen any other similar drive over the course of 
my career,’ says Gauntlett, who co-organized the London workshop. 

Just as with the fictional odd couple, however, this partnership still 
has plenty of friction. Everyone agrees, for example, that condensed- 
matter physicists are much more hesitant about pairing up than their 
string-theory counterparts. “I have been remarkably unsuccessful at 
getting condensed-matter physicists to let string theorists speak at their 
big meetings,” says Zaanen. “They fear that they will need to learn string 
theory to talk to them. It’s as though I am asking them to have coffee 
with aliens” 

Polchinski admits that the condensed-matter sceptics have a point. 
“T don't think that string theorists have yet come up with anything that 
condensed-matter theorists don't already know,’ he says. The quantita- 
tive results tend to be re-derivations of answers that condensed-matter 
theorists had already calculated using more mundane methods. 

To make matters worse, some of the testable predictions from string 
theory look a tad bizarre from the condensed-matter viewpoint. For 
example, the calculations suggest that when some crystalline materials 
are cooled towards absolute zero, they will end up in one of many lowest- 
energy ground states. But that violates the third law of thermodynamics, 
which insists that these materials should have just one ground state. 
“That's the gorilla in the room that should be keeping people awake at 
night,” says Gubser. 

To win over sceptics, theorists are busily searching for testable 
predictions that will lead to killer evidence that the collaborations are 
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String theory predicts the 
behaviour of black holes, 
represented here, and matter at 
ultra-small scales. 


Condensed-matter 
experiments look at the 
behaviour of ordinary matter, 
which can behave in unusual 
ways in extreme conditions. 


When a black hole is cooled to a 
sufficiently low temperature, a ‘halo’ 
of charged matter will suddenly form. 


The 3D surface for this 
event corresponds to 
electrons changing 
phase and starting to 
flow without resistance: 
the material becomes a 
superconductor. 
Physicists can look at 4D 
processes to hunt for 
other 3D transitions. 


worthwhile. Gauntlett’s group, and others, are hunting for black-hole 
configurations in the string-theory universe that map to undiscovered 
phase transitions*. The trick is to figure out which materials might 
exhibit those transitions. “Right now that involves going round ask- 
ing, ‘have you seen something like this?’,” says Gauntlett. “But the hope 
is that as techniques advance, experimenters will be able to engineer 
materials with the properties we predict?” 

Sachdev is applying string theory to an existing challenge: calculating 
how conductance should change with temperature as ultracold atoms 
transition from a superfluid state to an insulating one’. He thinks that 
it should be possible to test his predictions in the next couple of years. 

Even if the programme is successful, there are limits to how much 
the relationship can benefit either partner. String theory can offer a 
handbook of properties to look for, and predictions for how they should 
change in experiments, says Green. But it will never be able to provide a 
theory of how these properties emerge from the behaviour of electrons. 
Similarly, experimental verification of string theory’s predictions about 
condensed matter will not prove that strings themselves are an accurate 
description of reality. 

But perhaps, Green argues, the connection to materials will show that 
people have fundamentally misunderstood what string theory is. “Maybe 
string theory is not a unique theory of reality, but something deeper —a 
set of mathematical principles that can be used to relate all physical theo- 
ries,” says Green. “Maybe string theory is the new calculus.” m 


Zeeya Merali is a freelance writer based in London 
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Scientists are hoping to stall plans to erect a string of dams along the Mekong River. 


r | This summer, a crew of strangers arrived 
in the tiny village of Pak Lan along the 
Mekong River in northern Laos. They 

sat around in shorts, examining technical draw- 

ings, and then surveyed the area, measuring the 
height of the riverbank, the size of the rice pad- 
dies and even the number of pigs. 

The tally is necessary because Pak Lan may 
soon disappear. The government will need to 
move it and 18 nearby villages, because they 
will be partially or fully submerged if a highly 
controversial dam, called the Xayaburi, is built. 
The US$3.5-billion project will create a 60-kil- 
ometre-long reservoir and generate 1260 meg- 
awatts of power, which will earn between 
$3 billion and $4 billion a year for the developer, 
CH Karnchang Public Company of Thailand. 

Somchit Tivalak, village chief and repre- 
sentative of the ruling communist Lao Peo- 
ple’s Representative Party, is not quite sure 
what a hydroelectric dam is or how it will 
work, but he is convinced that good things 
are on the horizon. He says that his village 
will move to a place where it will have roads 
and electricity, as well as a reservoir teeming 
with fish. 

Many others, however, are deeply worried. 
The lower Mekong, which winds through Laos, 
Thailand, Cambodia and Vietnam, is one of 
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the last big untamed rivers in the world. Nearly 
60 million people depend on its rich fisheries 
for their survival. If the Xayaburi dam is built, 
it will set a precedent for 10 other hydropower 
dams proposed for the main stem of the river. If 
all those proceed, nearly 55% of the river will be 
converted to slow-flowing reservoirs. 

Predicting the effects of such massive 
changes is impossible because the Mekong is 
one of the most poorly studied major rivers in 
the world. Taxonomists know so little about 
the fish there that they are discovering new 
species at an unparalleled pace. And govern- 
ments do not consistently monitor water and 
sediment flows along the river. 

In the case of the proposed Xayaburi dam, 
some scientists say the environmental impact 
assessment (EIA) conducted for the builder is 
seriously flawed because it does not consider the 
wider effects of the dam. “The EIA of the Xay- 
aburi dam is the worst EIA that I've ever seen,” 
says Ian Baird, a professor of geography at Uni- 
versity of Wisconsin—Madison who has studied 
the region for decades. 

Cambodia and Viet- 
nam, which researchers 
say will receive a dis- 
proportionate share of 
the harm from the dam, 


For links to original 
documents, see 
online story : 


Damming the Mekong River could harm millions 
who rely on migratory fish, such as the fishermen 
in this floating village in Kandal, Cambodia. 


have both objected to it. And a scientific panel 
hired by the Mekong River Commission — a 
regulatory body made up of government repre- 
sentatives from Thailand, Laos, Cambodia and 
Vietnam — last year recommended a 10-year 
delay on damming the river so that researchers 
could gather the needed data. But the Laotian 
government, which will receive up to 30% of 
the revenue, says that it will push ahead. 

So scientists are rushing to assess how the 
dams will affect the Mekong’s fisheries and the 
flow of sediment that helps to sustain its vast 
delta. “The problem is, dams are coming very 
fast and are going to deeply modify the envi- 
ronment in a very short time frame,’ says Eric 
Baran, fisheries researcher at the World Fish 
Center in Phnom Penh. “And the countries are 
not equipped to deal with that yet” 


CALMING THE WATERS 

From its origins in the Tibetan plateau, the 
Mekong winds 4,800 kilometres down to the 
South China Sea, making it the longest river 
in Southeast Asia. At least 781 species of fresh- 
water fish ply its waters, including four of the 
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TAMING A RIVER 


Chinese dams along the upper Mekong River have enabled countries 
downriver to plan for a series of power-generating dams. Xayaburi in 


Laos is the first one to near construction. 
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largest freshwater fish species in the world. The 
biggest of them, the endangered Mekong giant 
catfish (Pangasianodon gigas), can grow to be 
as long asa car. 

Years of war, lack of investment, and 
drastic variations in flow between the wet and 
dry seasons have held back hydropower devel- 
opment and helped to keep the lower reaches 
of the river wild. But in the 1990s, Chinese 
engineers began a project to build eight dams 
and reservoirs on the Upper Mekong, which 
have evened out the flow (see “Taming a river’). 

With the Mekong suitably subdued and 
with the demand for electricity rising in the 
region, the Laotian government and private 
developers are now racing to put up dams, and 
Xayaburi is the first one to near construction. If 
itis completed, eight more are likely to spring 
up in Laos and along its border with Thailand, 
according to International Rivers, an environ- 
mental non-governmental organization based 
in Berkeley, California. 

The EIA found that Xayaburi’s effect on fish, 
water flow and erosion would be minimal. 
Instead of creating a large standing reservoir 
behind a massive concrete wall, Xayaburi will 
have a smaller wall that will allow water to pass 
beneath it in what is called a run-of-the-river 
design. According to the EIA, the dam will 
“improve the overall natural fish production 
capacity on the Mekong River in the project 
area, especially in the dry season’. 


But researchers have challenged that con- 
clusion, noting that Xayaburi and most of 
the dams proposed for the river's main stem 
will have concrete walls tall enough to raise 
upstream water levels by between 30 and 65 
metres. Although smaller than conventional 
reservoir dams, the walls would still block 
sedimentand migrating fish, says Tarek Ketelsen, a 
hydrologist at the International Centre for 
Environmental Management in Hanoi, 
Vietnam, which evaluated the Xayaburi EIA. 

Critics also object to the fact that the EIA 
considers the potential effects only for a “down- 
stream area about 10 kilometres from the bar- 
rage site’, according to the document. That 
is a remarkably small stretch of the river, say 
researchers. The EIA was conducted for Karn- 
chang by TEAM Group of Companies, a con- 
glomerate of consulting firms based in Bangkok. 
When contacted by Nature, TEAM said it could 
not discuss the EIA because of the terms of its 
contract with Karnchang. Karnchang did not 
respond to calls or e-mails requesting comment. 


FISHING FOR TROUBLE 

Toun Neang, 52, gets up at 4 a.m. every day to go 
fishing on the Tonlé Sap Lake, which connects 
to the Mekong River in Cambodia. When he 
arrives, he offers incense, rice and beer to the 
spirit in the river. “If we forgot to ask permission 
or make an offer, that day we will not be able to 
catch even a single fish,” he says. 
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A fisherman since childhood, Neang has 
a keen eye for the migration cycles that bring 
fish into and out of the lake from the Mekong. 
Adult fish lay eggs far upstream, and then flood- 
ing during the rainy season brings those eggs 
and juveniles to the Tonlé Sap, he says. He wor- 
ries about dams. “If the water is blocked, how 
can fish migrate downstream? And how can 
fishermen like us live if there are no more fish?” 

The future of the fishery matters because the 
Tonlé Sap — one of the world’s most productive 
inland fisheries for its size — provides half of the 
protein consumed in Cambodia. “It is hard for 
people in Europe or North America to imagine 
the role that freshwater capture plays in terms of 
food security, economically and even culturally,” 
says Kirk Winemiller, a fisheries researcher at 
Texas A&M University in College Station. 

Modelling the effect of Xayaburi and other 
dams on this fishery is difficult because 
researchers lack baseline data about most 
fish in the Mekong. Around 229 species live 
upstream of the proposed Xayaburi site, and 
70 of them are migratory. In terms of biomass, 
about 60% of the total catch in the Tonlé Sap is 
made up of species that migrate long distances, 
some from as far up as the Xayaburi area, more 
than 1,500 kilometres upstream. 

Many dams have built-in fish ladders 
that allow some migrating fish to pass. But 
researchers say the two ladders in Xayaburi’s 
design are not enough for the number of fish 
and the diversity of migratory species there. 

Among them is the Mekong giant catfish, 
the river's best-studied species and longest- 
distance swimmer. Zeb Hogan, fisheries 
researcher at the University of Nevada, Reno, 
spent years collecting fish and extracting cal- 
cified ear bones, called otoliths, from their 
heads. The otoliths grow a new layer each day, 
incorporating elements from the water, which 
creates a chemical record ofa fish's travels. 

Otolith studies have shown, for example, that 
the tropical Asian catfish Pangasius krempfi 
makes an epic migration (Z. Hogan et al. 
J. Fish Biol. 71, 818-832; 2007). It starts life in 
the higher reaches of the Mekong, then drifts 
down to the coastal flood plains during the 
monsoon season. Adult fish live in the brack- 
ish waters of the delta and the South China Sea, 
but they fight their way back upstream to spawn 
at the beginning of the rainy season every year. 

Michio Fukushima, a fisheries scientist 
at the National Institute for Environmental 
Studies in Tsukuba, Japan, and his colleagues 
at Ubon Ratchathani University in Thailand 
are trying to adapt otolith analysis to other spe- 
cies that migrate within the Mekong. Many of 
these species are commercially important, par- 
ticularly the Siamese mud carp (genus Heni- 
corhynchus), known as trey riel in Cambodia. 
This 15-centimetre-long fish is a major food 
source for larger carnivores. It is an impor- 
tant ingredient in fish paste and in feed used 
in aquaculture, and it is the most-harvested 
species in the Mekong. 


GARRISON PHOTOGRAPHIC & MRC; M. FUKUSHIMA; EL ALI/FOCUS/EYEVINE 


FEATURE | NEWS 


Left, fisherman on the Tonlé Sap haul in trey riel; centre, rings in a fish's ear bone record its Mekong travels; right, road construction near the Xayaburi dam site. 


Fukushima’ work has so far traced some of 
the riels migration routes. The fish he captured 
from the Songkhram, a Mekong tributary in 
Thailand, seem to mature in the main stem of 
the Mekong before returning to the tributary. 

Baird says that the riel may become threat- 
ened in the Mekong as dams are built. “You 
start putting dams along the river there, it will 
stop migration for the fish,” he says. “It is hard 
to say exactly — will it wipe it out all together 
or reduce it in number? We havent faced this 
situation with such a highly abundant species.” 

Even less is known about other fish in the 
Mekong. Fukushima and Baran are now creat- 
ing an atlas of fish distribution, and Baran and 
others are modelling the effects of dams on 
fisheries. Preliminary runs suggest that if all the 
proposed main-stem dams are built, the region's 
annual catch of 2.1 million tonnes will drop by 
somewhere between 600,000 and 1.4 million 


tonnes. “Six hundred thousand tonnes repre- 
sents the whole annual freshwater fish produc- 
tion in West Africa’, says Baran. “That's huge.” 


SEDIMENTAL JOURNEY 
The proposed dams will also exacerbate 
the Mekong Delta's ongoing battles with the 
sea. The delta, home to 17 million people in 
Vietnam and 2.4 million in Cambodia, seems 
to be losing coastal land, says James Syvitski, 
a geologist at the University of Colorado at 
Boulder. 

Sea levels there are rising by 6 millimetres 
a year because of a combination of global 
ocean swelling and local changes. And the 
destruction of mangrove forests has left 
the delta prone to devastating floods and 


typhoons. In a study of deltas around the 
world, Syvitski and his colleagues declared the 
Mekong Delta “in peril”, noting that an area 
of nearly 21,000 square kilometres is already 
less than two metres above sea level (J. S. et al. 
Nature Geosci. 2, 681-686; 2009). 

The proposed dams are projected to 
accelerate the sinking by blocking the flow of 
sediment that would otherwise nourish the 
flood plains and build up the delta. Mathias 
Kondolf, a fluvial geomorphologist at the 
University of California, Berkeley, estimates that 
the dams in China and on the lower Mekong 
will block about half of the river’s sediment, 
which could be disastrous for the delta. 

Some dam designs reduce the problem by 
incorporating wide, low-lying outlets that 
allow sediment to pass through. But these can 
compromise power generation, and might 
not let through the heavy sediment loads that 


would accumulate far upstream near the start 
ofa 60-kilometre-long reservoir, says Ketelsen. 

All these unknowns explain why the team 
of consultants assembled by the Mekong 
River Commission (MRC) last year called for 
a 10-year delay in building the Xayaburi dam, 
recommending that Laos start with smaller 
dams on tributaries. The MRC did not take 
a stand on the proposed moratorium and 
would not have the power to enforce it. But 
scientists say that the MRC does have the clout 
to influence the design of dams. 

“Hydropower is important for the develop- 
ment of a country like Laos, and it does have 
a right to develop,” says Ketelsen. “However, 
when it comes to a river like this, which has a 
global significance in terms of biodiversity, you 


don't have to start with the most high-impact 
projects”. The idea of waiting has gained some 
international support. The Asian Development 
Bankin Manila, for example, says that building 
dams on the main stem of the Mekong is pre- 
mature because too little is known about the 
environmental and social costs. 

Last April, Laotian authorities agreed to delay 
construction of Xayaburi until after conducting 
another project review, the results of which are 
due to be submitted to the four nations of the 
MRC ina final meeting in the next few weeks. 
But officials have said recently in media reports 
that they have completed the review and plan 
to go ahead with construction. The MRC is 
keeping silent, waiting for the Mekong nations 
to meet. Near the Xayaburi site in northern 
Laos, it does not look as though construction 
crews are waiting for the final meeting. Trucks 
are paving mud roads with asphalt — a 
necessary first step towards dam construction. 

Downriver in Ubon Ratchathani, Thailand, 
Fukushima gets on a speedboat to collect fish, 
water and sediment samples from a dam ona 
Mekong tributary. For the past two years, he 
has travelled through Cambodia, Laos and 
Thailand by boat and by motorcycle twice a 
year to collect data. When he captures a fish, he 
performs a rough surgery, slicing open its head 
to extract otoliths for later analysis in his lab. 

Fukushima says that before the dams 
become a reality, he wants to establish a base- 
line of environmental and ecological condi- 
tions and to try to work with developers so 
that future dams will cause the least amount 
of harm. He remains cautiously hopeful that 
science can make a difference. Looking out 
over the water, he says, “there must be some 
way we can move towards a better future”. m 


Gayathri Vaidyanathan is a reporter with 
Greenwire in Washington DC. She was 
previously an International Development 
Research Center fellow at Nature. 
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Taming the devil within us 


We are getting smarter, and asa result the world is becoming 
a more peaceful place, says Steven Pinker. 


cc he twentieth century was the 
[Posies in history.” This fre- 
quently asserted claim is popular 

among the romantic, the religious, the nos- 
talgic and the cynical. They use it to impugn 
a range of ideas that flourished in that cen- 
tury, including science, reason, secularism, 
Darwinism and the ideal of progress. But this 
historical factoid is rarely backed up by num- 
bers, and it is almost certainly an illusion. We 
are prone to think that modern life is more 
violent because historical records from 
recent eras are more complete, and 
because the human mind overestimates the 
frequency of vivid, memorable events. We 
also care more about violence today. Ancient 
histories are filled with glorious conquests 
that today would be classified as genocide, 


and the leaders known to history as So-and- 
So the Great would today be prosecuted as 
war criminals. 

Attempts to quantify the death tolls from 
earlier centuries suggest that many of the col- 
lapsing empires, conquering maniacs, horse- 
tribe invasions, slave trades and annihilations 
of native peoples had individual death tolls 
that, adjusted for population, are comparable 
to those of each of the two world wars. War 
before civilization was even bloodier. Foren- 
sic archaeology and ethnographic demogra- 
phy suggest that around 15% of people living 
in non-state societies died violently — five 
times the proportion of violent deaths in the 
twentieth century from war, genocide and 
man-made famines combined. 

Moreover, a century has 100 years, not 
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just 50, and the second half of the twentieth 
century has astonished military historians 
with its unprecedented avoidance of wars 
between developed states and between great 
powers. Civil wars did proliferate in the years 
after the two world wars, but civil wars tend 
to be less destructive than interstate wars, 
and they, too, have declined in number and 
in death tolls. There were far fewer deaths in 
war in the first decade of the twenty-first cen- 
tury than any of the five decades preceding it. 

The obsolescence of major war is just 
one of many historical declines of violence. 
European homicide rates have dropped at 
least 30-fold since the Middle Ages, from 
about 40 per 100,000 people per year in the 
fourteenth century to 1.3 at the end of the 
twentieth. Barbaric customs that were > 
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> unexceptional for millennia, such as 
human sacrifice, the persecution of witches 
and heretics, chattel slavery, blood sports, 
punitive torture and mutilation, sadistic 
executions (burning, breaking, crucifixion, 
disembowelling, impalement) and execution 
for victimless crimes have been abolished in 
most of the world. The past 50 years have 
seen a cascade of rights revolutions — civil, 
women’s, children’s, gay, animal — which 
have demonstrably driven down rates of 
lynching, pogroms, rape, spousal abuse, 
child abuse, spanking, gay-bashing, hunt- 
ing and callousness to laboratory animals. 

The historical decline of violence (see 
“Murder’s downfall’) is a challenging scien- 
tific puzzle for anyone interested in human 
nature. Violence is not just a cultural fad that 
is falling out of fashion, like bustles and spats. 
Aggressive behaviour is found throughout 
the history and prehistory of our species, and 
shows no signs of having been invented in one 
place and spread to the others. The human 
brain has conserved mammalian circuits 
for rage and dominance, and boys univer- 
sally play-fight. A majority of adults have 
homicidal fantasies and enjoy violent enter- 
tainment, and variation in violent tendencies 
across individuals is substantially heritable. 

At the same time, not a single category of 
violence has stayed at a fixed level over the 
course of history. Whatever causes violence, 
it is not a perennial urge like hunger, sex or 
sleep. 


THE BETTER ANGELS 

So what has caused the drop in violence? 
Little if any of the decline can be explained 
by natural selection. Biological evolution 
has a speed limit measured in generations, 
and many of the declines have unfolded over 
decades or years. 

The most promising explanation, I 
believe, is that the components of the 
human mind that inhibit violence — what 
Abraham Lincoln called “the better angels 
of our nature” — have become increasingly 
engaged. 

Today, the most famous of the better 
angels is empathy. It is being studied in chil- 
dren, chimpanzees, undergraduates and 
even single neurons, and has been lauded in 
bestselling books as the solution to human- 
ity’s problems. Indeed, an expansion of 
empathy — fostered by literacy, travel and 
cosmopolitanism — helps to explain why 
people today abjure cruel punishments and 
care more about the human costs of war. 

But empathy cannot be the whole story. 
Contrary to the popular notion that mirror 
neurons make primates reflexively empathic, 
empathy is a fickle emotion. It is triggered 
by cuteness, good looks, kinship, friendship, 
similarity and solidarity. And it is easily shut 
off or converted to its opposite, schaden- 
freude, by competition or revenge. 


The moral sense — another hot research 
topic in psychology — is also less peaceable 
than one might think. No society defines 
virtue solely by the avoidance of harm. Moral 
intuitions also emerge from concerns such as 
betraying a coalition, contaminating oneself 
or one’s community, and defying or insulting 
an authority. Because people feel that moral 
infractions are legitimately punishable, an 
expansive definition of morality provides an 
expansive list of offences for which the sinner 
can be violently punished, including homo- 
sexuality, licentiousness, blasphemy, heresy, 
indecency and desecration of sacred symbols. 

Indeed, because morality furnishes 
people with motives for violent acts that bring 
them no tangible benefit, it is more often the 
problem than the solution. If you added up 
all the homicides committed in pursuit of 
rough justice, the casualties of religious and 
revolutionary wars, the people executed for 
victimless crimes and the eggs broken in 
genocides to make utopian omelettes, they 
would surely outnumber the fatalities from 
amoral predation and conquest. 

The most important psychological con- 
tributor to the decline of violence over the 
long term may instead be reason: the cogni- 
tive faculties, honed by the exchange of ideas 
through language, that allow us to understand 
the world and negotiate social arrangements. 

Reason, admittedly, seems to have fallen 
on hard times. Popular culture is plumbing 
new depths of dumbness, and political dis- 
course has become a race to the bottom. We 
are living in an era of scientific creationism, 
New Age flimflam, 9/11 conspiracy theories 
and psychic hotlines. 

Even scientists are joining in. Human 
beings are led by their passions, say many 
psychologists, and deploy reason only to 
rationalize gut feelings after the fact. Behav- 
ioural economists exult in showing how 
human behaviour departs from the rational- 
actor theory, and sympathetic journalists 
waste no opportunity to smack the theory 


MURDER’S DOWNFALL 


Homicide was thirty times more frequent in the 
Middle Ages in Europe than it is today, and the 
average in non-state societies (such as tribes in New 
Guinea, Africa and the Americas) was even higher. 
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around. The implication is that because irra- 
tionality is inevitable, we may as well lie back 
and enjoy it. 

But I have come to believe that both the 
pessimistic assessment of the state of reason 
in the world, and any sentiment that this 
would not be such a bad thing, are mistaken. 
For all their foolishness, modern societies 
have been getting smarter, and all things 
being equal, a smarter world is a less violent 
world. 

Why might reason lead to less violence? 
The most obvious pathway is captured 

in French writer 


“A debunking Voltaire’s quip that 
of hogwash “those who can make 
willundermine Y° believe seek 
many ties on re cine 
rationales for commit atrocities”. 


A debunking of 
hogwash — such as 
the beliefs that gods 
demand sacrifices, heretics go to hell, Jews 
poison wells, animals are insensate, Africans 
are brutish and kings rule by divine right — 
will undermine many rationales for violence. 

Reason can also lead people to want less 
violence. This may seem to violate Scot- 
tish philosopher David Hume's dictum that 
“reason is, and ought to be, only the slave of 
the passions”. Reason, by itself, can lay out 
a road map to peace or to war, to tolerance 
or to persecution, depending on what the 
reasoner wants. 


violence.” 


LIFE OVER DEATH 

But, two conditions will tend to align reason 
with nonviolence. The first is that reasoners 
care about their own well-being. They prefer 
to live rather than die, keep their body parts 
intact and spend their days in comfort rather 
than in pain. Logic does not force them to 
have those prejudices. Yet any product of 
natural selection — indeed, any agent that 
has endured the ravages of entropy long 
enough to be reasoning in the first place — 
is likely to have them. 

The second condition is that a reasoner be 
part of a community of reasoners who can 
impinge on their well-being and who can 
comprehend each other’s reasoning. And 
indeed Homo sapiens is not just a rational 
animal but a social and language-using one. 
Self-interest and sociality combine with 
reason to lay out a morality in which non- 
violence is a goal. If one agent says, “It’s bad 
for you to hurt me’, he has also committed 
to “It’s bad for me to hurt you’, because logic 
cannot tell the difference between ‘me’ and 
‘you. Therefore as soon as you try to per- 
suade someone to avoid harming you by 
appealing to reasons why he shouldn't, you're 
sucked into a commitment to the avoidance 
of harm as a general goal. 

Humans, of course, were not created in 
a state of original reason. We descended 


from ape-like primates, spent hundreds of 
millennia in small bands and evolved our 
cognitive processes in the service of forag- 
ing and socializing. Only gradually, with 
the appearance of literacy, cities and long- 
distance travel and communication, could 
our ancestors cultivate their reason and 
apply it to a broader range of concerns. As 
collective rationality is honed over the ages, 
it will clamp down on short-sighted and 
hot-blooded impulses towards violence, and 
force us to treat a greater number of agents as 
we would have them treat us. 

To be sure, it remains puzzling that it 
took us so long to figure this out. Why did 
human rationality need thousands of years 
to conclude that something might be a wee 
bit wrong with slavery? Or with beating 
children, raping unattached women, exter- 
minating native peoples, imprisoning homo- 
sexuals or waging wars to assuage the injured 
vanity of kings? 

Perhaps humans have been getting nicer 
because they have been getting smarter. 


GETTING SMARTER ALL THE TIME 

In the early 1980s, the philosopher James 
Flynn at the University of Otago in Dun- 
edin, New Zealand, had a eureka moment 
when he noticed that the companies that sell 
IQ tests periodically renormalize the scores. 
Later generations, given the same set of 
questions as earlier ones, got more of them 
correct. Flynn scoured the world for test 
scores, and the result was the same in every 
sample: IQ scores had increased throughout 
the twentieth century. An average teenager 
today, if he or she could time-travel back to 
1910, would have had an IQ of 130, anda 
typical person of 1910, if time-transported 
forward to the present, would have a mean 
IQ of 70. 

The increase is not in general intelligence, 
the heritable factor underlying all the com- 
ponents of intelligence (such as vocabulary, 
arithmetic and knowledge). It is concen- 
trated in abstract reasoning, such as noting 
similarities (“What do a pound and an inch 
have in common?”) and analogies (“BIRD 
is to EGG as TREE is to what?”). The most 
likely causes are increases in the duration 
and quality of schooling, the spread of 
symbol-manipulation into work and leisure, 
and the trickling down of scientific and ana- 
lytical reasoning into everyday life. 

Could an expansion of reason really have 
driven down violence? Consider the state- 
ments of the great men of a century ago, 
such as Theodore Roosevelt, who wrote: 
“I dont go so far as to think that the only 
good Indians are dead Indians, but I believe 
nine out of ten are, and I shouldn't like to 
inquire too closely into the case of the tenth’, 
or the young Winston Churchill, who cheer- 
fully carried out atrocities in British colonies 
in Asia and Africa and wrote: “I hate Indians. 


Peacekeepers, such as NATO in Kosovo, are highly successful, an example of how reason has tamed violence. 


They are a beastly people with a beastly reli- 
gion.” Today we are stunned by the compart- 
mentalized morality of these men, who in 
many ways were enlightened when it came 
to their own race. Yet they never took the 
mental leap that would have forced them 
to treat other races with the same consid- 
eration. Today’s children have been encour- 
aged to take these cognitive leaps with gentle 
instruction such as “There are bad Indians 
and there are good Indians, just like there 
are bad white people and good white peo- 
ple. You can't tell whether a person is good 
or bad by looking at the colour of his skin’, 
and “Yes, the things those people do look 
funny to us. But the things we do look funny 
to them.” Such lessons are not indoctrina- 
tion but guided reasoning, leading children 
to conclusions they can accept by their own 
standards, and the resulting understanding 
has become second nature. 

Is there any evidence that enhancements in 
thinking can make us less violent? Cognitive 
neuroscience suggests that morality is driven 
not just by the limbic circuits underlying 
emotion but also by parts of the prefrontal 
cortex that underlie abstract thought. And the 
historical record shows that many humane 
advances were initiated in the realm of ideas. 
Philosophers prepared careful briefs against 
slavery, despotism, torture, religious persecu- 
tion, cruelty to animals, harshness to children, 
violence against women and frivolous wars. 
These arguments were disseminated in pam- 
phlets and bestsellers and debated in salons 
and pubs, and then in conventions and legis- 
latures that implemented reforms. 

There are also more direct links between 
reason and peace. On average, and holding 
all else constant, people with greater reason- 
ing abilities commit fewer violent crimes, 
are more likely to cooperate in experimen- 
tal games, and have more classically liberal 


attitudes, such as opposition to racism and 
sexism. And on average, holding all else 
constant, societies with higher levels of 
educational and intellectual achievement 
are more receptive to democracy, and have 
lower rates of civil war. 

Advocates of reason and its gifts, such as 
science, technology and secular democracy, 
should no longer feel that they must be on 
the defensive. The association between 
the best and the worst of the twentieth 
century was always crude, and it is time 
to re-examine it in the light of statistically 
literate history. Almost seven decades after 
the horrors of the first half of the twenti- 
eth century, we now see that they were 
not a new normal or harbingers of worse 
to come, but a local high from which the 
world would bumpily descend. The ideolo- 
gies behind them were atavisms that ended 
up in the dustbin of history, and the ideal 
of universal human rights, which would 
have seemed saccharine or incoherent to 
our ancestors, has become the moral com- 
monplace of our age. 

The forces of reason have not, of course, 
pushed steadily in one direction; nor will 
they ever bring about utopia. But reason has 
done more than enhance our health, experi- 
ence and knowledge — it has, quantifiably, 
made the world a less violent place. m 


Steven Pinker is in the Department 

of Psychology at Harvard University, 
Cambridge, Massachusetts 02138, USA. 
This article is adapted from his new book 
The Better Angels of Our Nature: The 
Decline of Violence in History and its 
Causes (Allen Lane, 2011). 

e-mail: pinker@wjh.harvard.edu 

Further reading accompanies this article online 


at go.nature.com/zwnmmt, and full references 
accompany the book. 
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NATIONAL PSORIASIS FOUNDATION 


A participant’s involvement in research doesn’t have to end the day a sample is collected. 


Treat donors as partners 
in biobank research 


Proposed rules to protect research subjects will impede 
progress, say Krishanu Saha and J. Benjamin Hurlbut. 
Instead, give donors more say in how samples are used. 


r “en years after the human genome 
project, the potential for personalized 
medicine lies not in a single genome 

but in many’. Before physicians can discern 
features of disease that are specific to indi- 
viduals, researchers need to catalogue the 
enormous range of genomic and phenotypic 
variation in human populations. Collec- 
tions of biospecimens that are accompanied 
by data on medical history, behaviour and 
health outcomes are crucial to this task’’. 
If built well, these biobanks will facilitate 
research on health and disease for decades 
to come, but they require significant invest- 
ment up front. This includes getting large 
numbers of people to submit biospecimens 
and personal health information. 

Building such participation has not been 
easy. The US National Cancer Institute’s 
(NCI’s) Office of Biorepositories and Bio- 
specimen Research found in 2009 that 80% 
of researchers have difficulty acquiring qual- 
ity biospecimens, and that this compromises 
research findings’. It is understandable that 
people hesitate to participate, as controversies 


have eroded public trust in scientists who col- 
lect specimens for research. A 2009 lawsuit 
against the Texas Department of State Health 
Services alleged that researchers had violated 
the rights of newborns by not seeking paren- 
tal consent to use their bloodspots in research, 
and led to the destruction of 5 million samples 
(see Nature 475, 156-158; 2011). The 2010 
book The Immortal Life of Henrietta Lacks 
by Rebecca Skloot raised questions about 
researchers’ obligations towards Henrietta 
Lacks, who became an unwitting research 
subject when her cancer cells were used to 
create an important research tool: the HeLa 
cell line. In 2010, Arizona State University in 
Tempe settled a lawsuit stemming from the 
allegedly unauthorized use of blood samples 
from the local Havasupai tribe (see go.nature. 
com/kcuacr for more). 

Partly in response to these controversies, 
and for the first time in 20 years, the United 
States has proposed substantial changes to 
‘the Common Rule, the regulations that gov- 
ern research involving human subjects’. The 
90-day public comment period closes on 
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26 October. This is an opportune moment to 
reconsider the role and rights of participants 
in biomedical research. 

Current practices in managing biobanks 
tend to see the public as little more than a 
resource for mining data and materials, and 
as a potential source of resistance. Participants 
provide information or tissues with little or no 
knowledge of the researchers’ priorities, goals 
or expected outcomes. Barriers are erected. 
Materials and information are ‘de-identified’ 
to protect people's identities. Participants 
neither see how their donations are used, nor 
what the research produces. 

As they stand, the proposed changes to 
the Common Rule’ risk further widening 
the divide between researchers and donors. 
The changes encourage blanket consent 
— asking donors to authorize virtually 
everything, with opt-out checkboxes for 
predefined categories of research that might 
pose “unique concerns’, such as creating a 
cell line’. Scientists could use the samples 
for additional projects without seeking 
re-consent. The changes are intended to 
ensure that the scope of authorized research 
is clear to all parties, thereby circumvent- 
ing the ambiguities that were at the heart of 
the Havasupai case. But this is achieved by 
telling donors next to nothing about how 
and for what purpose their donation will 
be used. Although this may reduce admin- 
istrative complexity, we believe that it will 
decrease, not increase, public involvement 
in biobanks and fail to deliver on its ethical 
aim of better protecting participants rights. 

Most people prefer to have a say in how 
their donations are used. Ina study in which 
donors were asked whether they would 
make their de-identified data available for 
future research, 90% said it was ‘important’ 
that they were asked for permission — yet 
the vast majority (86%) gave their consent’. 

We need an alternative approach, in 
which donors are made partners by stay- 
ing connected to research. Partnership is a 
win-win approach: it will build trust, make 
research better and faster, and generate large, 
diverse cohorts with longitudinal data’. 
By staying connected with participants (for 
example through the Internet), scientists 
can seek additional information and assem- 
ble cohorts for specific studies without the 
administrative hassles of the current system. 


PATHWAYS TO PARTNERSHIP 

Instead of letting failures and controver- 
sies drive reform, we can learn from pro- 
jects that have developed innovative ways 
of connecting participants with research. 
Disease advocacy organizations have con- 
structed biobanks by mobilizing disease 
communities to make rare biospecimens 
available to researchers. They have also 
demanded a more active role in shaping 
biomedical research priorities, offering 
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banked materials to encourage the research 
they care about. The advocacy organizations 
reject the assumption that the asymmetry in 
scientific expertise between researchers and 
participants justifies asymmetry in research 
governance. Working with scientists, these 
organizations have enhanced research: they 
have uncovered genes responsible for Hun- 
tington’s disease and multiple myeloma; 
discovered biomarkers for spinal muscu- 
lar atrophy; and coordinated clinical trials 
for children with ataxia telangiectasia. One 
study’ found that 91% of these organizations 
assist in participant recruitment, 75% col- 
lect data for research, 45% support a regis- 
try or biobank and 56% have assisted with 
study design. Because the rules that gov- 
ern most mainstream biobanks preclude 
these activities, many advocacy organi- 
zations have created their own biobanks 
— such as the Genetic Alliance Biobank, 
which researchers can only access by 
applying to an advocacy organization. 

But disease advocacy organizations are 
primarily motivated to find cures for specific 
diseases, not to bolster general, exploratory 
research. Therefore, resources such as the 
Genetic Alliance Biobank, although valu- 
able, do not achieve an important goal of 
biobanking — to build diverse collections 
in order to reveal factors such as genes and 
biomarkers that cut across disease categories 
or express variably in the same individual 
as disease develops”*. However, compa- 
nies such as PatientsLikeMe in Cambridge, 
Massachusetts, and 23andMe in Mountain 
View, California, demonstrate that indi- 
viduals who lack the strong motivation and 
resources of disease advocates will neverthe- 
less participate in exploratory biobanking, if 
the terms are right. 

Both companies offer benefits to indi- 
viduals — personalized genomic informa- 
tion in the case of 23andMe and insights 
into disease conditions with PatientsLikeMe 
— while building collective resources for 
research. Both biobanks have developed 


PEOPLE POWER 


The biobanks of two companies have grown 
rapidly by using the web to partner with 
participants. 
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rapidly and with remarkably low over- 
heads by using the web (see ‘People power’). 
PatientsLikeMe has built an extensive repos- 
itory of detailed phenotypic and health data, 
crowd-sourced from a community of more 
than 115,000 participants. 23andMe now 
holds biosamples and genomic data from 
in excess of 120,000 individuals, more than 
87% of whom have opted to participate 
in research. These biobanks are powerful 
tools. Researchers can assemble cohorts 
in hours rather than months, and rapidly 
acquire follow-on data from participants 
with minimal administrative hassle. When 
a large multi-year study funded by the US 
National Institutes of Health revealed a gene 
related to Parkinson's disease, 23andMe 
nimbly replicated the finding within an 
hour, using genomic data it already had 
in its then much smaller 30,000-per- 
son database, which included more than 
3,000 people with Parkinson’s*. 

Through a web portal, participants control 
the information they provide, see how it is 
put to use and are regularly updated on what 
new findings mean for them. These compa- 
nies have achieved trust, while still protecting 
privacy. The result is that participants have 
chosen to give more than the minimum. 
They are rewarded by witnessing scientific 
progress in process — including what new 
knowledge means for them as individuals. 

These companies, like advocacy organiza- 
tions, operate on the fringes of biomedicine 
and are regarded with suspicion by some, 
who doubt the validity of self-reported data 
collected online, and fear the risks of direct- 
to-consumer genetic testing. But the rapid 
growth of the companies’ biobanks suggests 
that they have found a promising way to 
achieve trust while advancing science. 


ABETTER INFRASTRUCTURE 
Rather than encourage a one-size-fits-all 
consent process’, the Common Rule should 
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depend on from coercion — in 
earning the effect, the right to 
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right of participants 
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EVOLUTION 


Lies we tell ourselves 


Stuart West is inspired by Robert Trivers’ evolutionary argument that self-deception 
is crucial to deceiving others effectively. 


ature is filled with exquisite 
| \ | examples of deception. Fireflies 
mimic the flashes of others to attract 
and eat them; birds make fake alarm calls 
to scare competitors away from their lunch; 
male fish mimic females so that they can 
sneak sex while avoiding the wrath of larger 
males. In Deceit and Self-Deception (US title 
The Folly of Fools), Robert Trivers, one of the 
most influential evolutionary biologists since 
Charles Darwin, invites us to look closer to 
home to find a more surprising form of 
deception — one in which we fool ourselves. 
In this provocative and wide-ranging 
book, Trivers examines the evidence that self- 
deception occurs, theorizes why it happens 
and shares the insights that can be gained 
from looking at human behaviour. In discuss- 
ing the science of deception, he doesn't shrink 
from giving his views on areas in which it has 
implications, such as drug addiction. 
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Trivers starts by 
making a clear and 
powerful case that 
conflict — common 
at all levels of inter- 
action, from groups to 
individuals and genes 
— favours deception. 
If an individual can 
mislead others, they 
gain an advantage, 
whether in a compe- 
tition for mates or in 
division of labour. 


4 


TRIVERS 


Deceit and Self- 
Deception: Fooling 
Yourself the Better 
to Fool Others 
ROBERT TRIVERS 
Basic Books/Allen 


A strong theme is Lane: 2011. 
that deception drives 352 pp./320 pp. 
detection mecha- %28/£25 


nisms, which drive 

better deception and so on, in a deceitful 
co-evolutionary dance. So we should not be 
surprised that humans have evolved sophis- 
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ticated methods for producing and detecting 
deception. Trivers argues that deception itself 
may have been an important evolutionary 
force in selecting the large brains and intel- 
ligence that now allow us to study it. 

The book goes beyond the evolution- 
ary literature by arguing that individuals 
can be selected to deceive themselves. The 
idea that we, or any other organism, should 
deceive itself is arresting. Why should we go 
to the trouble of having elaborate sensory 
and neurobiological systems that obtain and 
process information about the environment, 
only to throw away or hide that information 
in exchange for a distorted view? 

Trivers’ theory is that individuals mislead 
themselves because it 
helps them to deceive 
others more convinc- 
ingly. Imagine two 
individuals, human or 


Nature’s web focus 
on being human: 


ILLUSTRATIONS BY ALESSANDRO GOTTARDO 


other species, wrangling over a resource such 
as food, territory or mates. Each displays their 
quality and assesses the other's. The result is 
that the one least likely to win backs down. 
Trivers argues that if one individual thinks 
that they are bigger and stronger than they 
are, they will display as such — and in a way 
that makes them more likely to fool others. 

To examine this idea, Trivers brings 
together neurophysiology, behaviour, immu- 
nology and psychology. Behavioural data 
show that some classes of individual are more 
likely to misjudge factors such as their own 
intelligence or sexual orientation. Neurologi- 
cal data suggest that the conscious mind has 
a positively biased view of self, relative to the 
subconscious mind. Immunological studies 
show that deception has costs to immuno- 
logical function. And psychological studies 
suggest that biased memory, denial and pro- 
jection are common. 

Trivers relies heavily on data obtained by 
methodologies from brain scans to immu- 
nological assays to plethysmographs that 
measure organ volume. He aims to stimulate 
research and debate, raising more questions 
than he answers. How might we model self- 
deception from an evolutionary perspective? 
How do we distinguish the psychology of 
deception from that of self-deception? Could 
we test for it in non-humans? What are the 
competing hypotheses? How do we test the 
costs and benefits? What are the psychologi- 
cal consequences? 

A common theme in his examples is that 
deception has been selected because of its 
benefits at the individual level, but that this 
can lead to disastrous consequences at, say, 
industrial or national level. Trivers offers 
much food for thought on these topics, in 
chapters that are likely to provoke contro- 
versy. He suggests that self-deception has a 
major role in the initiation and justification 
of wars, the development of false historical 
narratives and the existence of religion. 

Deceit and Self-Deception has broad appeal 
and a well-structured narrative. Trivers adds 
numerous anecdotes, such as what to do when 
being searched by the police (look away), or 
why you should avoid walking with him near 
squirrels (in case he makes a sudden lunge, 
provoking an attack). 

Trivers touches on wide-ranging issues: 
the role of evolutionary biology in the social 
sciences; the placebo effect; lie detectors; 
genocide; the scientific method. But he con- 
veys a powerful and focused message: if we 
can learn to recognize and fight our own 
self-deception, we can avoid negative con- 
sequences at levels from the individual to the 
national, and live better lives. = 
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biology in the Department of Zoology, 
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A question of 


INdsSS 


The struggle to find a theory of the weak nuclear force 
reveals intriguing personalities, finds Edwin Cartlidge. 


r Vhe Higgs boson is an expensive 
quarry. Finding this as-yet-unseen 
elementary particle, thought to 

endow others with mass, is the head- 

line aim of the Large Hadron Collider 

— a venture costing billions of dollars 

and involving thousands of scientists at 

CERN, Europe's high-energy physics lab 

near Geneva, Switzerland. Just why the 

Higgs is so significant is laid out in Frank 

Close’s fascinating book The Infinity Puzzle, 

which chronicles the hunt to pin down the 

fundamental forces of nature, and the 
human triumphs and failings along the way. 

Close, a particle physicist, offers a com- 
pelling history and sociology of modern 
particle theory. We discover the motiva- 
tions and achievements of a rich cast of 
brilliant individuals, and get enough of the 
science to grasp what they were trying to do. 
Where Close really shines is in exposing the 
fraught process of recognition in science, 
focusing on key players such as Pakistani 
theoretical physicist Abdus Salam and 
the man after whom the famous boson is 
named, British physicist Peter Higgs. We get 
a feel for what has been called “Nobelitis” — 
the preoccupation with claims to discovery 
that can afflict pioneers in their fields. 

The book’s focus is 
the search, starting in 
the 1950s, for a the- 
ory to describe the 
weak nuclear force, 
which is responsible 
for radioactive decay. 
The challenge was to 
devise a theory that 
was consistent with 
the strange laws of 
quantum mechanics, 
but that did not pre- 
dict absurd infinities 
for the values of some particle properties. 

An analogous problem had arisen when 
physicists tried to describe electromagne- 
tism using a theory known as quantum 
electrodynamics (QED). Julian Schwinger 
and Richard Feynman found a solution 
to it in the late 1940s; they bypassed the 
infinities by calculating the electron’s 


IT 1S HARD TO SEE 
HOW THE HIGGS 
DISCOVERY ... WILL BE 
ABLE TO TELLUS 


WHY WE ARE 
HERE. 


properties relative 
to known values of 
its charge and mass 
(a trick known as 
‘renormalization. 
Unfortunately, no 
such mathematical 
feat was evident in 
the more difficult 
The InfinityPuzzle cases of the weak 
FRANK CLOSE force or the strong 
Oxford University * 
Press: 2011. 416 pp. nuclear force, which 
£16.99, $28.99 holds protons and 
neutrons together in 
atomic nuclei. 

In 1954, Chen-Ning (Frank) Yang and 
Robert Mills used the mathematics of 
group theory to put forward a description 
of the strong force that was analogous to 
QED. Later, their work would form the 
basis of successful theories for both the 
strong and weak forces. But it was dis- 
missed at the time because it predicted 
that particles transmitting the force have 
electric charge but no mass — particles 
that are not found in nature. 

The solution to the quandary of these 
massless particles, it turned out, was ‘sym- 
metry breaking. According to this idea, 
all particles were 
equal and massless 
in the very hot early 
Universe, and it was 
only as the Uni- 
verse expanded and 
cooled that this sym- 
metry broke down 
and particles with 
different masses 
condensed out. 
Sheldon Glashow, 
Steven Weinberg 
and Salam were 
awarded the 1979 Nobel Prize in Physics 
for incorporating symmetry breaking into 
a description of the weak force, thus creat- 
ing the ‘electroweak theory that united the 
weak and the electromagnetic forces. In 
1999, Dutch physicists Gerardus ‘t Hooft 
and Martinus Veltman won the Nobel for 
proving that the electroweak theory > 
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>» could be renormalized. It is now hoped 
that the Large Hadron Collider will show 
whether particle masses in nature really do 
arise because of symmetry breaking, and 
whether the Higgs or some more exotic 
mechanism is responsible. 

Close points out that the decades-long 
search for a theory of the weak force is full 
of “wrong turns, partial answers, and mislaid 
arguments’, with a liberal dash of Nobelitis. 
Glashow, sensing the prize was close, had 
an “acute” bout of it towards the end of the 
1970s, whereas Salam had a “chronic version 
spanning many years”. Close asks whether 
Salam — who, he suggests, seemed to recog- 
nize the importance of symmetry breaking 
in the context of the weak force only after 
‘t Hooft’s work was published — was as 
deserving ofa share of the prize as the others. 
He also asks whether Salam’s long-time 
collaborator, British physicist John Ward, 
deserved greater credit. According to 


Close, Salam and Ward had a lesser claim 
than the others. He argues that Glashow 
deserves the credit for first laying out the 
basic theory; Weinberg for incorporating 
symmetry breaking; and later ‘t Hooft and 
Veltman for putting the theory on a solid 
mathematical footing. 

Other stories emerge from this mine- 
field. We learn how Salam’s colleague Ron- 
ald Shaw arrived first at a theory like that of 
Yang and Mills but didn’t publish. Regarding 
Higgs, Close does not say whether he would 
have a bigger claim to a Nobel than the five 
other theorists who published papers on 
symmetry breaking at around the same 
time. But he does point out that Higgs was 
the only one who explicitly refers to a mass- 
giving boson in his 


paper — and hence DNATURE.COM 
deserves to have his _ Frank Close reviews 
name attached to it. lan Sample’s book on 
Close’s history of _ the Higgs boson: 
the field is engaging _ go.tiafure.com/ewtcuy 
What Technology Wants 
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Kevin Kelly (Penguin, 2011; $17) 

The founding executive editor of Wired magazine, Kevin Kelly, looks at the 
relationship between humans and technology in a book described by reviewer 
Zaheer Baber as “original and timely” (Nature 468, 372-373; 2010). Kelly 
argues that the increasing complexity of technology means that we have less 
control over how mechanical systems evolve. Although this “technium” is 
neither alive nor sentient, Kelly warns that it may develop its own wants. 


and gives insight into how great theories 
are created. So it is regrettable that on the 
last page, the author inserts unnecessary 
hyperbole, claiming that discovering the 
Higgs boson may allow us to answer a ques- 
tion that has so far been asked only by phi- 
losophers — why is there something rather 
than nothing? 

It is hard to see how the Higgs discovery, 
or any other breakthrough in physics, will 
be able to tell us why we are here, and why 
the laws of physics are the way they are. But 
it could, as Close says, help us to under- 
stand why nature’s particles and forces 
have the properties that they do. Although 
less grand than explaining our existence, 
this knowledge would be a handsome 
reward for the time and money spent 
battling infinities. = 


Edwin Cartlidge is a science writer based 
in Rome. 
e-mail: edwin.cartlidge@yahoo.com 
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Gaming, guns and virtual gold 


John Gilbey savours a fast-paced technological thriller based at the intersection 
of the real world and the cyber-universe. 


conference in Silicon Valley, California. 

The author was sitting in a melee of 
scientists and technologists, but seemed 
oddly detached. If I had gone closer, I 
suspect I might have seen a flashing red 
‘Record light in the corner of one eye as he 
absorbed his surroundings for future use. 

Superb attention to detail is one defining 
characteristic of Stephenson's latest novel, 
REAMDE. A thousand-page epic, the book 
hinges on the computer virus of the title, 
but offers a panorama of developing cyber- 
economies, online gaming, international 
terrorism and the flow of economic power 
from one culture to another. Stephenson 
frames the story as a series of intersections 
between the lives of techno-folk and the rest 
of the world, building a morass of foul-ups 
and misunderstandings that rings true. 

The novel centres on 50-something 
loner Richard Forthrast, head of a 
successful online-gaming company 
called Corporation 9592. Already 
rich from a youth misspent back- 
packing drugs into the United 
States from Canada, Forthrast is 
an aficionado of online gaming — a 
passion that has led him to create T’Rain, 
now the dominant online role-play game of 
the age. 

The REAMDE virus is designed to trap 
users of T’Rain by professional — if naive 
— hackers based in China. Once it infects 
a computer, the virus (with a nod to real 
examples) encrypts stored documents and 
demands a ransom. REAMDE adds a twist. 
The ransom must be paid in virtual gold 
within the game, from where it is collected 
by the virus writers in a form of ‘gold-farm- 
ing’ — a means of working within games to 
convert virtual assets into actual cash value. 

The scale of the infection disturbs the 
balance of the game: once word of huge 


I almost met Neal Stephenson once, at a 


caches of gold gets out, every bandit and 
warlord of the virtual world seeks to make 
it his or her own — by force. 

The degree of action in this multi-threaded 
plot puts most Hollywood blockbusters in 
the shade. Stephenson's narrative moves 
seamlessly between real and virtual worlds 
with impressive pace, and his descriptions 
of technology are accurate and believ- 
able. Aside from a couple of non-catalogue 
components introduced by an MI6 agent, 
there are no ‘magic boxes here. The novel is 
peopled with well-drawn characters such 
as Zula, Forthrast’s software-engineer niece 

whose abduction sets the plot 


running, and Csongor, a Hungarian security 
specialist forced to survive on his wits. Add 
dangerously intelligent terrorists, darkly sinis- 
ter state-security forces, grimy travel alternat- 
ing with the height of luxury, and you find a 
novel in which a largely sympathetic cast of 


characters is thrown 
in at the deep end of an 
extreme environment. 

Unlike much of 
Stephenson’s writ- 
ing, REAMDE is set 
in the present. But it 
shares many aspects 
of his previous work, 
such as the highly 
regarded future-based 
novel Snow Crash: a 
strong-willed young 
heroine, dramatic and 
cinematic action, and 
lots of technology. His 
writing style has developed markedly since 
his early novels, in which gleeful descriptions 
of violent mayhem occasionally verged on 
the cartoonish. The opening sequence of 
REAMDE, for example, describes a 

fraught family reunion in rural Iowa 

— with various family members 
and hangers-on seeking approval 
for their shooting skills. The group's 
strong shared values and equally deep 
divisions come across powerfully in a 
portrayal that is reminiscent of Annie Proulx 
or, to a degree, Garrison Keillor. 

There is nothing routine about this 
masterly techno-thriller. REAMDE mixes 
elements of current concern in technology 
and in society. It makes clear how far we 
have moved towards a single global com- 
munications, entertainment and business 
environment, while still being bound by 
our own local cultural heritage. Stephenson 
demonstrates just how explosive that com- 
bination can be. m 


REAMDE 

NEAL STEPHENSON 
Atlantic Books/William 
Morrow: 2011. 

912 pp./1056 pp. 
£18.99/$35 


John Gilbey is in the Department of 
Computer Science, Aberystwyth University, 
Aberystwyth SY23 3DB, UK. 

e-mail: gilbey@bcs.org.uk 


The Moral Landscape: How Science Can — 
Determine Human Values 

Sam Harris (Free Press, 2011; $15) 

Journalist Sam Harris argues that science can help 
us to understand moral values. Harris “eloquently 
counters the jaded pessimists who think that 
science has little to say about happiness”, wrote 
reviewer Pascal Boyer (Nature 469, 297; 2011). 


Chasing the Sun: The Epic Story of the Star 
That Gives Us Life 

Richard Cohen (Simon & Schuster, 2011; £8.99) 
Richard Cohen examines why our star is studied 
and even worshipped. Douglas Gough noted that 
the book “paints a fascinating and far-reaching 
scene that incorporates nearly all aspects of solar 
phenomena” (Nature 468, 504-506; 2010). 
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ex and freedom 


W. F. Bynum applauds the life of a great educator who 


fought for parental choice. 


ampaigner and educator Margaret 
( Sanger was passionate: about life, her 

cause — the right of women to limit the 
size of their family — and her many lovers. 
She lived her long life in the fast lane, dying in 
1966 aged 87. Historian Jean Baker’s account 
gives all that verve its due. 

Sanger’s rise to international prominence 
came against the odds. Born to a large Irish 
Catholic working-class family in upstate New 
York, she had neither educational nor cultural 
advantages. Her mother died of tuberculosis, 
and Sanger, too, had the disease, although 
apparently in her tonsils, not her lungs. Med- 
ical school was beyond her means, so she 
trained as a nurse but never completed the 
course. Neither of her two marriages was con- 
ventional: she hated domesticity and was fre- 
quently away from home. Her three children 
were often cared for by others, although she 
did return home to hold her dying daughter. 

Sanger devoted her life to the cause for 
which she is still remembered by many, and 
vilified by some: birth control. She grew up 
knowing at first hand what the rigours of fre- 
quent childbearing with little income took 
out of women, and saw many more examples 
in her brief career as a district nurse in New 
York City before the First World War. 

She and her first husband, architect Wil- 
liam Sanger, moved in bohemian, socialist 
and even anarchist circles. Women’s right to 
vote was the most pressing issue among femi- 
nists of the time, although many of Sanger’s 
radical friends set suffragist struggles within 
the larger social and political context. In her 
world view, all these movements paled before 
the central one: the liberation that separating 
sexuality from repro- 


duction could give to OD NATURE.COM 

women in particular, Michael Sargent on 

and society in general. _ population control: 
Sanger faced  go.nature.com/tvesex 


formidable problems. 
Contraception was not 
only taboo, it was ille- 
gal. Under-the-counter 
methods — foams, 
caps, pessaries and 
douches — were unre- 
liable and cumber- 
some, although more 


MARGARET | 
= SANGER| 
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effective than nothing. beg ape ael abal 
The condom, which JEAN H. BAKER 


was much improved pil! & Wang: 2011. 
during Sanger’s early 368 pp. $35 
career, was officially 
used only to prevent venereal disease. Absti- 
nence was hardly an easy option, and the 
‘rhythm method; which Sanger distrusted, 
was hit-and-miss. Knowledge of the female 
reproductive cycle was still rudimentary. 
Despite these difficulties, Sanger believed 
that women deserved help in limiting their 
family sizes. From a small campaign to open 
aclinic to the international organizations that 
she helped to create, the legality and availabil- 
ity of birth control were her abiding passions. 
This brought her into confrontation with 
the law (including time in jail), the Catholic 
Church and politicians. She also had frequent 
contact with physicians, and the peculiarities 
of US society in the early twentieth century 
meant that she needed them to run her clinics. 
Physicians were exempted from some 
of the strictures of the punitive Comstock 
laws, named after Anthony Comstock 
(1844-1915), an ardent puritan who per- 
suaded Congress to pass draconian prohibi- 
tions on all manner of things that he deemed 
obscene. Comstock clashed with Sanger and 
many others over contraceptive devices, 
which fell within his remit as a postal 
inspector; it was an offence to send anything 
obscene through the postal service. Com- 
stock went on to inspire the youthful J. Edgar 


Hoover, ensuring that the FBI continued the 
vigilance. So the gaps in the Comstock laws, 
and Sanger’s own pragmatic attitude to her 
cause, meant that birth control was medical- 
ized almost from the beginning. 

Baker elegantly recounts Sanger’s single- 
minded pursuit of her obsession. Her account 
reminds us that there has long been a litigious 
element in US society. Substitute birth control 
with the terms stem-cell research or abortion, 
and a lot of what Baker says is uncomfortably 
topical. To her credit, she never preaches, even 
if her own modernist liberal stance is clear. 
Instead, she reveals a portrait of a complex 
woman whose heart was in the right place. 

By sheer dint of will, Sanger became a 
woman of international standing and an 
inspiration to both sexes. Her vision of birth 
control became mixed with eugenic con- 
cerns — common among many reformers of 
her period — but she never lost her abiding 
concern for the plight of individual women. 

One could write a psychobiography of 
Margaret Sanger. She was often called a hys- 
teric; her granddaughter described her as a 
nymphomaniac. Her love life, involving the 
writer H. G. Wells and several of her lawyers 
and assistants, was colourful and crowded. 
Yet such a portrait would miss the point of 
Sanger’s achievement, and Baker is wise to 
rely on the evidence to reveal her subject. 

Sanger had an attractive personality and 
many friends. This was fortunate: she needed 
the rich to support her and her cause. Among 
her many rich allies was Katharine McCor- 
mick, a biology graduate of the Massachu- 
setts Institute of Technology in Cambridge, 
whose husband had a serious mental disorder, 
probably schizophrenia. McCormick devoted 
much of their fortune to medical research, 
and it was her friendship with Sanger that led 
McCormick to support the work of biologist 
Gregory Pincus and gynaecologist John Rock 
on hormones that prevent ovulation. Sanger 
thus lived to see what she had long wished for, 
an effective contraceptive: the pill. 

Population control still comes too low in 
global initiatives. Sanger’s career reminds us 
that many of the world’s problems originate 
because there are simply too many of us. m 


W. E. Bynum is professor emeritus of medical 
history at University College London, UK. 
e-mail: w.bynum@ucl.ac.uk 


The Evolution of Childhood: Relationships, 
Emotion, Mind 

Melvin Konner (Harvard Univ. Press, 2011; $22.50) 
Anthropologist Melvin Konner reveals how our 
childhood affects who we are individually and as 
a species. Morten Kringelbach described Konner 
as “an excellent tour guide to the sacred lands of 
childhood” (Nature 467, 918-919; 2010). 


318 | NATURE | VOL 478 | 20 OCTOBER 2011 


© 2011 Macmillan Publishers Limited. All rights reserved 


Evolution: The First Four Billion Years 
Edited by Michael Ruse and Joseph Travis (Harvard 
Univ. Press, 2011; $24.95) 

This authoritative introduction to evolutionary 
biology delves into the history and controversies 
of the field. It includes an encyclopaedic section 
covering key figures and topics — from Aristotle 
and altruism to E. O. Wilson and sociobiology. 


Pathogenic passengers 


Edward C. Holmes welcomes a robust account of how 
viruses emerge and how pandemics can be prevented. 


andemic disease is a world traveller. 
Px: simple fact underlies much of the 
work of the Global Viral Forecasting 
Initiative (GVFI), an organization established 
and run by microbiologist Nathan Wolfe, 
which aims to predict and prevent disease 
pandemics. So it is apt that I read Wolfe’s The 
Viral Storm ona crowded airliner bound for 
Beijing, and fortunate that the book was suf- 
ficiently diverting to keep me awake for 13 
hours. It is an excellent piece of scientific 
gothic, rich in descriptions of the threat we 
face from emerging viruses and how we might 
prevent them from becoming pandemic. 
The book is part description of viruses 
and how they jump species to emerge in 
new hosts, part autobiography and part 
advertisement for the GVFI. Wolfe discusses 
viral natural history and the history of past 
pandemics, splicing in references to his 


The Emperor of All Maladies 

Siddhartha Mukherjee (Fourth Estate, 2011; £9.99) 
In his ‘biography of cancer’, physician Siddhartha 
Mukherjee provides a new perspective on the 
disease and the people affected by it. While 
showing how modern understanding of cancer 

is leading to new tactics to control it, he also 
explores how humans fought it in the past. 


research trips to the developing world. His 
clear message is that humans are entering a 
new pandemic age, fuelled by such factors as 
the rise in global travel, urbanization, defor- 
estation and changing agricultural practices, 
all of which increase the likelihood of human 
exposure to animal species that carry poten- 
tially deadly viral infections. 

In this connected world, new viruses are 
able to spread quickly, perhaps with dev- 
astating effect. “A storm is brewing,” says 
Wolfe, and science must retool to meet this 
pandemic threat. That will mean increased 
surveillance of potential reservoir species, 
the deployment of new technologies that can 
quickly identify the infectious agent respon- 
sible for a disease outbreak, and the use of 
social-networking media to track cases of 
human infection in real time. 

Wolfe’s arguments are generally 
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convincing, particularly on the science of 
viral surveillance and the part that social 
media might play in this; the next pan- 
demic will doubtless be both televised and 
tweeted. Yet he overgeneralizes at times — for 
example, when stressing the role that hunt- 
ing, by exposing humans to diverse animal 
species, has in viral emergence. Hunting 
may be responsible for the emergence of 
only a small number of human infections — 
although these include HIV, which ignited 
the current interest in emerging viruses and 
may have entered humans through the con- 
sumption of infected ‘bush meat’ Some over- 
exuberance should be 
forgiven, as Wolfe has 
a gift for conveying 
the thrill of scien- 
tific research, writing 
with both gusto and 
panache. 

His most provoca- 
tive claim is that it is 
possible to predict 
which diseases will 
emerge in the future. 
This has been called 


The Viral Storm: 
The Dawn of a New 
Pandemic Age 


NATHAN WOLFE . 

Allen Lane/Times: the holy grail of 
2011. 320 pp. research into infectious 
£14.99/$26 disease, and has > 


Written in Stone: Evolution, the Fossil Record, 
and Our Place in Nature 

Brian Switek (Icon Books, 2011; £12.99) 

Fossil finds are revealing ever more about Earth 
and human history, explains Brian Switek in this 
accessible book. Reviewer Jan Zalasiewicz called 
it “a fine guide to the four-dimensional tapestry 
of life” (Nature 469, 32-33; 2011). 
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> attracted considerable research funding. 
Although ‘prediction’ is an enticing word, 
how it will be done is vaguely described at 
best. In Wolfe’s research programme, predic- 
tion seems to involve fine-scale surveillance 
— for instance, through increasingly high- 
tech surveys of viral biodiversity in potential 
reservoir species, the monitoring of unusual 
animal die-offs, and the minute-by-minute 
screening of Google trends, Twitter and 
mobile-phone data. 

Gathering such data is laudable, but still 
more reactive than predictive. In reality, 
accurate prediction faces challenges that 
could prove insurmountable. A novel virus 
discovered in a potential reservoir species 
may not replicate in human cells; a virus that 
replicates in human cells may not transmit 
between humans; anda variety of epidemio- 
logical processes dictate that even if such a 
virus is able to transmit between hosts, it 
may not spread through a population. Pre- 
dicting viral emergence therefore requires a 
difficult, and perhaps unattainable, synthesis 
of genetics and epidemiology. 

I believe that this fad for prediction 
presents a greater danger: that we establish 
expectations so unrealistic that they are met 
with inevitable failure, in turn undermin- 
ing public confidence. Another unwelcome 
consequence, in my view, is the diversion of 
research funds from basic biological studies 
of viral emergence — such as defining the 
relationship between a virus and its host-cell 
receptor — to more speculative programmes, 
such as predicting future viral evolution. 

Wolfe has become the public face of emerg- 
ing disease. This enjoyable, well researched 
and thought-provoking book shows that he 
has a clear vision of how pandemics occur 
in human populations and the part he 
might play in their prevention and control. 
Although it is not clear what the coming years 
will hold, one safe prediction is that Wolfe will 
have alot more to say. On the evidence of this 
book, he is worth listening to. m 


Edward C. Holmes is a professor in the 
Department of Biology, Pennsylvania State 
University, University Park, Pennsylvania 
16802, USA, and the Fogarty International 
Center, National Institutes of Health, 
Bethesda, Maryland 20892, USA. 

e-mail: echolmes@psu.edu 
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The inspiration 
exchange 


Chris Lintott assesses an idea that unites individual 
scientific expertise with collective intelligence. 


each morning and choosing what to 
work on froma list of problems submit- 
ted by scientists from around the world. The 
requests cut across academic boundaries, 
yet have been selected with you in mind 
as the person best qualified to solve them. 
What would such a system achieve? What 
would your career be like as a consequence? 
This scenario is the vision of writer and 


[e= sitting down at your computer 


quantum-computation expert Michael 
Nielsen. In 2007, he announced on his blog 
that he felt he could have more impact by 
developing new scientific tools than by 
pursuing his physics research. Reinventing 
Discovery, his thought-provoking call to 
arms, suggests he was right. 

Nielsen is convinced that, as a result of 
our growing ability to share information 
and ideas, we are living through a revolution 


The Arsenic Century: How Victorian Britain was 
Poisoned at Home, Work, and Play 

James C. Whorton (Oxford Univ. Press, 2011; £9.99) 
Arsenic in the nineteenth century was ubiquitous 
— stored in kitchens as a rat killer, or used to 

dye sweets, candles and gloves. Historian James 
Whorton shows how its deadly past resonates 
with the environmental poisonings of today. 
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Where Good Ideas Come from: The Seven 
Patterns of Innovation 

Steven Johnson (Penguin, 2011; £9.99) 

Good ideas, says writer Steven Johnson, are 
rarely produced by lone geniuses. Innovation 
more often grows out of a network of minds, he 
argues. Universities offer the best chance for 
breakthroughs as they lack market pressures. 


in how knowledge is 
constructed. He argues 
that, just as the wide- 
spread adoption of the 
scientific method in 
the seventeenth cen- 
tury made discovery 
a realistic aim for any- 
one who could devise 
Reinventing the right experiment, 
Discovery: The so the rise of data- 


New Era of ‘ : : 
Networked Science driven discovery will 


MICHAEL NIELSEN transform the ability of 
Princeton University the average scientist to 
Press: 2011.280pp. contribute. Although 
$24.95, £17.95 there is room for the 


lone genius, the rest of 
us need all the help we can get. 

Researchers should take advantage of the 
emergence of this collective intelligence to 
tackle more difficult problems, Nielsen sug- 
gests. In one chapter (on which I gave feed- 
back), he describes a variety of web-based 
‘citizen science’ and other case studies to 
marshal his points. Some have research as 
their goal, such as my Galaxy Zoo project 
(www.galaxyzoo.org), which enlists the 
public’s help in classifying galaxies, and 
the collaborative mathematics carried out 
by participants in the Polymath projects 
(www.polymathprojects.org). But a strength 
of Nielsen’s book is that many more of the 
examples come from beyond academia. 

For instance, a virtual game of chess in 
1999 pitted the then world champion Garry 
Kasparov against ‘the world; as represented 
by the votes of some 50,000 online par- 
ticipants. It was more keenly fought than 
expected. The game hinged on a critical 
move by ‘the world’ that was proposed by 
US women’s champion Irina Krush. Moving 
the black queen to fork two pawns, Krush 
exploited her knowledge of a previous situ- 
ation shed studied to open up a novel posi- 
tion. She wasn't in Kasparov's league, but she 
had the ability and knowledge to exploit the 
situation in which her team found itself. 

This idea of micro-expertise underlies 
Nielsen’s vision of networked discovery. If 
enough people are exposed toa problem, he 
explains, the odds of one of them having the 
special insight or analytical ability needed 
to solve it increases, and so does the speed 
of discovery. 


Internet Alley: High Technology in Tysons 
Corner, 1945-2005 

Paul E. Ceruzzi (MIT Press, 2011; $14.95) 
Historian Paul Ceruzzi uncovers the story of the 
hub in Virginia from where much of the Internet is 
managed and governed. “Ceruzzi chronicles the 
evolution of Internet Alley astutely and accurately,” 
wrote Joel Shurkin (Nature 452, 533; 2008). 
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The key is that, as with Krush, the right 
person is faced with the right problem at the 
right time. What links the successful case 
studies in the book is that each has a system 
for directing the attention of the multitude 
of participants. Posts on the Polymath blog, 
for example, are structured so that each 
contains only one insight, making it easier 
to direct your contributions. It is this sort 
of systematic focusing of attention that 
moves us beyond the traditional ‘wisdom of 
crowds to a more powerful and subtle col- 
lective intelligence, capable of solving even 
complex problems. 

It is tempting to rail against the lack of 
romance in Nielsen’s collective vision, but a 
division of labour has been under way in sci- 
ence for decades. Many scientists are happy 
to leave the creation of large data sets to those 
who specialize in it: modern astronomers, 
for example, are more likely to write a data- 
base query than to visit a telescope. Last year 
saw the launch of ScienceExchange.com, 
which allows professional labs to sell their 
spare experimental capacity in a market 
tailored to that commodity. An exchange for 
inspiration, or at least for mental capacity, 
may not be far behind. 

Ina plea for an “open science imperative’, 
Nielsen argues that scientists need to do 
more than swap data and code. They need 
to share speculative thoughts — including 
half-finished ideas and open questions. 
Some are already doing this: take Rosie 
Redfield’s open-resource blog chronicling 
her microbiology experiments and bench- 
based musings from her lab at the University 
of British Columbia, Canada (see go.nature. 
com/ae82wp). But it is still a lonely endeav- 
our, and until a substantial number of lead- 
ing researchers take the plunge, it is likely to 
remain so. 

Nielsen ends simply, with the suggestion 
that we all try openness. The lack of a step- 
by-step programme for the revolution comes 
as a disappointment, but it is to his credit 
that he convinces us that radical change is 
a real possibility. Reinventing Discovery will 
frame serious discussion and inspire wild, 
disruptive ideas for the next decade. m 


Chris Lintott is an astrophysicist at the 
University of Oxford, UK. 
e-mail: cjl@astro.ox.ac.uk 


Leonardo’s Legacy: How Da Vinci Reimagined 
the World 

Stefan Klein (Da Capo Press, 2011; $16) 
Leonardo da Vinci designed a working robot, a 
heart valve and weapons. Science writer Stefan 
Klein analyses aspects of the artist’s scientific 
genius, from well-known projects such as his 
flying machine to other, more obscure inventions. 
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Capturing free will 


Jacek Debiec enjoys two complementary books charting 
the psychology and neuroscience of decision-making. 


rong or right, we are constantly 
making choices. Decision-mak- 
ing is not just about choosing 


careers, political representatives or the per- 
son you want to share your life with. Most 
of the time it is deciding where to put your 
foot as you take your next step, when to take 
a bite of your sandwich or whether to scratch 
that itch on your shoulder. Over your life- 
time, you make trillions of decisions, mostly 
remaining unaware of doing so. 

Two books examine different aspects 
of how we make judgements and choices. 


Oliver Sacks (Picador, 2011; £8.99) 


Thinking, Fast and Slow 

DANIE AHNEMA 
Allen Lane/Farrar, Straus and Giroux: 2011. 
512 pp. £25/$30 


Who’s in Charge? Free Will and the Science 
of the Brain 
Ecco: 2011. 272 pp. $27.99, £18.24 

Psychologist Daniel Kahneman, a pioneer 
in the science of decision-making who won 
the 2002 Nobel prize in economics, sees the 
workings of the mind as the interaction of 


two systems, conscious and unconscious, that 
are continuously at play. In Thinking, Fast 
and Slow he shows why his ideas shook the 
existing model of rational decision-making. 
In Whos in Charge?, cognitive neurosci- 
entist Michael Gazzaniga tackles a related 
question: given the range of automatic 
and deliberative factors that underlie our 
choices, why do we feel so in control when 
we make them? 

The books complement each other beau- 
tifully. Kahneman provides a detailed, yet 
accessible, description of the psychologi- 
cal mechanisms involved in making deci- 
sions. Gazzaniga descends into the neural 
circuits that underpin those mechanisms. 
There are instructive contrasts too. Kahne- 
man focuses on practical dimensions of the 
choice ‘machinery; such as the role of atten- 
tion, where biases come from and ways of 
avoiding judgement errors. Gazzaniga, 
meanwhile, attempts to understand inherent 
philosophical issues, such as the roots and 
nature of consciousness and free will. Both 
books deliver an up-to-date review of the 
research in psychology and neuroscience. 

Kahneman proposes a dual scheme for 
decision-making in the brain. System I is 
fast, effortless and intuitive, whereas Sys- 
tem II is more deliberative and logical, and 
consequently slower. Both operate continu- 
ously while we are awake, with System I as 
the default controlling our actions, and Sys- 
tem II monitoring and activating whenever 
System I runs into difficulty. An experienced 
driver, for example, will drive ‘on autopilot’ 
until they reach something unexpected, such 
as a detour. They will then need System II 
to find their way through unfamiliar streets. 

Neither system, Kahneman shows, is 
foolproof on its own. You can suppress your 
intuitions but cannot turn them off at will: 
that is why errors of System I occur. But Sys- 
tem II is too slow to be efficient for making 
routine decisions. 

Discoveries regarding these two systems 
— their motivating factors, limitations, 
operating principles and the interplay 
between them — have had an impact on dis- 
ciplines ranging from psychology and eco- 
nomics to law and philosophy. Kahneman, 
through his work on how life satisfaction 
affects decision-making, has contributed 
to the development of positive psychology 
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A tumour led neurologist Oliver Sacks to lose his 


stereoscopic vision, prompting him to study other 


visual disorders and their effects on people. He 
reveals “the complex roles of sight in human life 
and the constitution of personal identity”, wrote 


Steve Silberman (Nature 467, 1036-1037; 2010). 
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Rebecca M. Jordan-Young (Harvard Univ. Press, 
2011; $19.95) 

Rebecca Jordan-Young reviews the research on 
supposed differences in male and female brains, 
arguing for more rigour. Virginia Valian called it “a 
welcome corrective” (Nature 470, 332-333; 2011). 
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Gaming, guns and virtual gold 


John Gilbey savours a fast-paced technological thriller based at the intersection 
of the real world and the cyber-universe. 


conference in Silicon Valley, California. 

The author was sitting in a melee of 
scientists and technologists, but seemed 
oddly detached. If I had gone closer, I 
suspect I might have seen a flashing red 
‘Record light in the corner of one eye as he 
absorbed his surroundings for future use. 

Superb attention to detail is one defining 
characteristic of Stephenson's latest novel, 
REAMDE. A thousand-page epic, the book 
hinges on the computer virus of the title, 
but offers a panorama of developing cyber- 
economies, online gaming, international 
terrorism and the flow of economic power 
from one culture to another. Stephenson 
frames the story as a series of intersections 
between the lives of techno-folk and the rest 
of the world, building a morass of foul-ups 
and misunderstandings that rings true. 

The novel centres on 50-something 
loner Richard Forthrast, head of a 
successful online-gaming company 
called Corporation 9592. Already 
rich from a youth misspent back- 
packing drugs into the United 
States from Canada, Forthrast is 
an aficionado of online gaming — a 
passion that has led him to create T’Rain, 
now the dominant online role-play game of 
the age. 

The REAMDE virus is designed to trap 
users of T’Rain by professional — if naive 
— hackers based in China. Once it infects 
a computer, the virus (with a nod to real 
examples) encrypts stored documents and 
demands a ransom. REAMDE adds a twist. 
The ransom must be paid in virtual gold 
within the game, from where it is collected 
by the virus writers in a form of ‘gold-farm- 
ing’ — a means of working within games to 
convert virtual assets into actual cash value. 

The scale of the infection disturbs the 
balance of the game: once word of huge 


I almost met Neal Stephenson once, at a 


caches of gold gets out, every bandit and 
warlord of the virtual world seeks to make 
it his or her own — by force. 

The degree of action in this multi-threaded 
plot puts most Hollywood blockbusters in 
the shade. Stephenson's narrative moves 
seamlessly between real and virtual worlds 
with impressive pace, and his descriptions 
of technology are accurate and believ- 
able. Aside from a couple of non-catalogue 
components introduced by an MI6 agent, 
there are no ‘magic boxes here. The novel is 
peopled with well-drawn characters such 
as Zula, Forthrast’s software-engineer niece 

whose abduction sets the plot 


running, and Csongor, a Hungarian security 
specialist forced to survive on his wits. Add 
dangerously intelligent terrorists, darkly sinis- 
ter state-security forces, grimy travel alternat- 
ing with the height of luxury, and you find a 
novel in which a largely sympathetic cast of 


characters is thrown 
in at the deep end of an 
extreme environment. 

Unlike much of 
Stephenson’s writ- 
ing, REAMDE is set 
in the present. But it 
shares many aspects 
of his previous work, 
such as the highly 
regarded future-based 
novel Snow Crash: a 
strong-willed young 
heroine, dramatic and 
cinematic action, and 
lots of technology. His 
writing style has developed markedly since 
his early novels, in which gleeful descriptions 
of violent mayhem occasionally verged on 
the cartoonish. The opening sequence of 
REAMDE, for example, describes a 

fraught family reunion in rural Iowa 

— with various family members 
and hangers-on seeking approval 
for their shooting skills. The group's 
strong shared values and equally deep 
divisions come across powerfully in a 
portrayal that is reminiscent of Annie Proulx 
or, to a degree, Garrison Keillor. 

There is nothing routine about this 
masterly techno-thriller. REAMDE mixes 
elements of current concern in technology 
and in society. It makes clear how far we 
have moved towards a single global com- 
munications, entertainment and business 
environment, while still being bound by 
our own local cultural heritage. Stephenson 
demonstrates just how explosive that com- 
bination can be. m 


REAMDE 

NEAL STEPHENSON 
Atlantic Books/William 
Morrow: 2011. 

912 pp./1056 pp. 
£18.99/$35 


John Gilbey is in the Department of 
Computer Science, Aberystwyth University, 
Aberystwyth SY23 3DB, UK. 

e-mail: gilbey@bcs.org.uk 


The Moral Landscape: How Science Can — 
Determine Human Values 

Sam Harris (Free Press, 2011; $15) 

Journalist Sam Harris argues that science can help 
us to understand moral values. Harris “eloquently 
counters the jaded pessimists who think that 
science has little to say about happiness”, wrote 
reviewer Pascal Boyer (Nature 469, 297; 2011). 


Chasing the Sun: The Epic Story of the Star 
That Gives Us Life 

Richard Cohen (Simon & Schuster, 2011; £8.99) 
Richard Cohen examines why our star is studied 
and even worshipped. Douglas Gough noted that 
the book “paints a fascinating and far-reaching 
scene that incorporates nearly all aspects of solar 
phenomena” (Nature 468, 504-506; 2010). 
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ex and freedom 


W. F. Bynum applauds the life of a great educator who 


fought for parental choice. 


ampaigner and educator Margaret 
( Sanger was passionate: about life, her 

cause — the right of women to limit the 
size of their family — and her many lovers. 
She lived her long life in the fast lane, dying in 
1966 aged 87. Historian Jean Baker’s account 
gives all that verve its due. 

Sanger’s rise to international prominence 
came against the odds. Born to a large Irish 
Catholic working-class family in upstate New 
York, she had neither educational nor cultural 
advantages. Her mother died of tuberculosis, 
and Sanger, too, had the disease, although 
apparently in her tonsils, not her lungs. Med- 
ical school was beyond her means, so she 
trained as a nurse but never completed the 
course. Neither of her two marriages was con- 
ventional: she hated domesticity and was fre- 
quently away from home. Her three children 
were often cared for by others, although she 
did return home to hold her dying daughter. 

Sanger devoted her life to the cause for 
which she is still remembered by many, and 
vilified by some: birth control. She grew up 
knowing at first hand what the rigours of fre- 
quent childbearing with little income took 
out of women, and saw many more examples 
in her brief career as a district nurse in New 
York City before the First World War. 

She and her first husband, architect Wil- 
liam Sanger, moved in bohemian, socialist 
and even anarchist circles. Women’s right to 
vote was the most pressing issue among femi- 
nists of the time, although many of Sanger’s 
radical friends set suffragist struggles within 
the larger social and political context. In her 
world view, all these movements paled before 
the central one: the liberation that separating 
sexuality from repro- 


duction could give to OD NATURE.COM 

women in particular, Michael Sargent on 

and society in general. _ population control: 
Sanger faced  go.nature.com/tvesex 


formidable problems. 
Contraception was not 
only taboo, it was ille- 
gal. Under-the-counter 
methods — foams, 
caps, pessaries and 
douches — were unre- 
liable and cumber- 
some, although more 
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effective than nothing. beg ape ael abal 
The condom, which JEAN H. BAKER 


was much improved pil! & Wang: 2011. 
during Sanger’s early 368 pp. $35 
career, was officially 
used only to prevent venereal disease. Absti- 
nence was hardly an easy option, and the 
‘rhythm method; which Sanger distrusted, 
was hit-and-miss. Knowledge of the female 
reproductive cycle was still rudimentary. 
Despite these difficulties, Sanger believed 
that women deserved help in limiting their 
family sizes. From a small campaign to open 
aclinic to the international organizations that 
she helped to create, the legality and availabil- 
ity of birth control were her abiding passions. 
This brought her into confrontation with 
the law (including time in jail), the Catholic 
Church and politicians. She also had frequent 
contact with physicians, and the peculiarities 
of US society in the early twentieth century 
meant that she needed them to run her clinics. 
Physicians were exempted from some 
of the strictures of the punitive Comstock 
laws, named after Anthony Comstock 
(1844-1915), an ardent puritan who per- 
suaded Congress to pass draconian prohibi- 
tions on all manner of things that he deemed 
obscene. Comstock clashed with Sanger and 
many others over contraceptive devices, 
which fell within his remit as a postal 
inspector; it was an offence to send anything 
obscene through the postal service. Com- 
stock went on to inspire the youthful J. Edgar 


Hoover, ensuring that the FBI continued the 
vigilance. So the gaps in the Comstock laws, 
and Sanger’s own pragmatic attitude to her 
cause, meant that birth control was medical- 
ized almost from the beginning. 

Baker elegantly recounts Sanger’s single- 
minded pursuit of her obsession. Her account 
reminds us that there has long been a litigious 
element in US society. Substitute birth control 
with the terms stem-cell research or abortion, 
and a lot of what Baker says is uncomfortably 
topical. To her credit, she never preaches, even 
if her own modernist liberal stance is clear. 
Instead, she reveals a portrait of a complex 
woman whose heart was in the right place. 

By sheer dint of will, Sanger became a 
woman of international standing and an 
inspiration to both sexes. Her vision of birth 
control became mixed with eugenic con- 
cerns — common among many reformers of 
her period — but she never lost her abiding 
concern for the plight of individual women. 

One could write a psychobiography of 
Margaret Sanger. She was often called a hys- 
teric; her granddaughter described her as a 
nymphomaniac. Her love life, involving the 
writer H. G. Wells and several of her lawyers 
and assistants, was colourful and crowded. 
Yet such a portrait would miss the point of 
Sanger’s achievement, and Baker is wise to 
rely on the evidence to reveal her subject. 

Sanger had an attractive personality and 
many friends. This was fortunate: she needed 
the rich to support her and her cause. Among 
her many rich allies was Katharine McCor- 
mick, a biology graduate of the Massachu- 
setts Institute of Technology in Cambridge, 
whose husband had a serious mental disorder, 
probably schizophrenia. McCormick devoted 
much of their fortune to medical research, 
and it was her friendship with Sanger that led 
McCormick to support the work of biologist 
Gregory Pincus and gynaecologist John Rock 
on hormones that prevent ovulation. Sanger 
thus lived to see what she had long wished for, 
an effective contraceptive: the pill. 

Population control still comes too low in 
global initiatives. Sanger’s career reminds us 
that many of the world’s problems originate 
because there are simply too many of us. m 


W. E. Bynum is professor emeritus of medical 
history at University College London, UK. 
e-mail: w.bynum@ucl.ac.uk 


The Evolution of Childhood: Relationships, 
Emotion, Mind 

Melvin Konner (Harvard Univ. Press, 2011; $22.50) 
Anthropologist Melvin Konner reveals how our 
childhood affects who we are individually and as 
a species. Morten Kringelbach described Konner 
as “an excellent tour guide to the sacred lands of 
childhood” (Nature 467, 918-919; 2010). 
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Evolution: The First Four Billion Years 
Edited by Michael Ruse and Joseph Travis (Harvard 
Univ. Press, 2011; $24.95) 

This authoritative introduction to evolutionary 
biology delves into the history and controversies 
of the field. It includes an encyclopaedic section 
covering key figures and topics — from Aristotle 
and altruism to E. O. Wilson and sociobiology. 


Pathogenic passengers 


Edward C. Holmes welcomes a robust account of how 
viruses emerge and how pandemics can be prevented. 


andemic disease is a world traveller. 
Px: simple fact underlies much of the 
work of the Global Viral Forecasting 
Initiative (GVFI), an organization established 
and run by microbiologist Nathan Wolfe, 
which aims to predict and prevent disease 
pandemics. So it is apt that I read Wolfe’s The 
Viral Storm ona crowded airliner bound for 
Beijing, and fortunate that the book was suf- 
ficiently diverting to keep me awake for 13 
hours. It is an excellent piece of scientific 
gothic, rich in descriptions of the threat we 
face from emerging viruses and how we might 
prevent them from becoming pandemic. 
The book is part description of viruses 
and how they jump species to emerge in 
new hosts, part autobiography and part 
advertisement for the GVFI. Wolfe discusses 
viral natural history and the history of past 
pandemics, splicing in references to his 


The Emperor of All Maladies 

Siddhartha Mukherjee (Fourth Estate, 2011; £9.99) 
In his ‘biography of cancer’, physician Siddhartha 
Mukherjee provides a new perspective on the 
disease and the people affected by it. While 
showing how modern understanding of cancer 

is leading to new tactics to control it, he also 
explores how humans fought it in the past. 


research trips to the developing world. His 
clear message is that humans are entering a 
new pandemic age, fuelled by such factors as 
the rise in global travel, urbanization, defor- 
estation and changing agricultural practices, 
all of which increase the likelihood of human 
exposure to animal species that carry poten- 
tially deadly viral infections. 

In this connected world, new viruses are 
able to spread quickly, perhaps with dev- 
astating effect. “A storm is brewing,” says 
Wolfe, and science must retool to meet this 
pandemic threat. That will mean increased 
surveillance of potential reservoir species, 
the deployment of new technologies that can 
quickly identify the infectious agent respon- 
sible for a disease outbreak, and the use of 
social-networking media to track cases of 
human infection in real time. 

Wolfe’s arguments are generally 
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convincing, particularly on the science of 
viral surveillance and the part that social 
media might play in this; the next pan- 
demic will doubtless be both televised and 
tweeted. Yet he overgeneralizes at times — for 
example, when stressing the role that hunt- 
ing, by exposing humans to diverse animal 
species, has in viral emergence. Hunting 
may be responsible for the emergence of 
only a small number of human infections — 
although these include HIV, which ignited 
the current interest in emerging viruses and 
may have entered humans through the con- 
sumption of infected ‘bush meat’ Some over- 
exuberance should be 
forgiven, as Wolfe has 
a gift for conveying 
the thrill of scien- 
tific research, writing 
with both gusto and 
panache. 

His most provoca- 
tive claim is that it is 
possible to predict 
which diseases will 
emerge in the future. 
This has been called 


The Viral Storm: 
The Dawn of a New 
Pandemic Age 


NATHAN WOLFE . 

Allen Lane/Times: the holy grail of 
2011. 320 pp. research into infectious 
£14.99/$26 disease, and has > 


Written in Stone: Evolution, the Fossil Record, 
and Our Place in Nature 

Brian Switek (Icon Books, 2011; £12.99) 

Fossil finds are revealing ever more about Earth 
and human history, explains Brian Switek in this 
accessible book. Reviewer Jan Zalasiewicz called 
it “a fine guide to the four-dimensional tapestry 
of life” (Nature 469, 32-33; 2011). 
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> attracted considerable research funding. 
Although ‘prediction’ is an enticing word, 
how it will be done is vaguely described at 
best. In Wolfe’s research programme, predic- 
tion seems to involve fine-scale surveillance 
— for instance, through increasingly high- 
tech surveys of viral biodiversity in potential 
reservoir species, the monitoring of unusual 
animal die-offs, and the minute-by-minute 
screening of Google trends, Twitter and 
mobile-phone data. 

Gathering such data is laudable, but still 
more reactive than predictive. In reality, 
accurate prediction faces challenges that 
could prove insurmountable. A novel virus 
discovered in a potential reservoir species 
may not replicate in human cells; a virus that 
replicates in human cells may not transmit 
between humans; anda variety of epidemio- 
logical processes dictate that even if such a 
virus is able to transmit between hosts, it 
may not spread through a population. Pre- 
dicting viral emergence therefore requires a 
difficult, and perhaps unattainable, synthesis 
of genetics and epidemiology. 

I believe that this fad for prediction 
presents a greater danger: that we establish 
expectations so unrealistic that they are met 
with inevitable failure, in turn undermin- 
ing public confidence. Another unwelcome 
consequence, in my view, is the diversion of 
research funds from basic biological studies 
of viral emergence — such as defining the 
relationship between a virus and its host-cell 
receptor — to more speculative programmes, 
such as predicting future viral evolution. 

Wolfe has become the public face of emerg- 
ing disease. This enjoyable, well researched 
and thought-provoking book shows that he 
has a clear vision of how pandemics occur 
in human populations and the part he 
might play in their prevention and control. 
Although it is not clear what the coming years 
will hold, one safe prediction is that Wolfe will 
have alot more to say. On the evidence of this 
book, he is worth listening to. m 


Edward C. Holmes is a professor in the 
Department of Biology, Pennsylvania State 
University, University Park, Pennsylvania 
16802, USA, and the Fogarty International 
Center, National Institutes of Health, 
Bethesda, Maryland 20892, USA. 

e-mail: echolmes@psu.edu 
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The inspiration 
exchange 


Chris Lintott assesses an idea that unites individual 
scientific expertise with collective intelligence. 


each morning and choosing what to 
work on froma list of problems submit- 
ted by scientists from around the world. The 
requests cut across academic boundaries, 
yet have been selected with you in mind 
as the person best qualified to solve them. 
What would such a system achieve? What 
would your career be like as a consequence? 
This scenario is the vision of writer and 


[e= sitting down at your computer 


quantum-computation expert Michael 
Nielsen. In 2007, he announced on his blog 
that he felt he could have more impact by 
developing new scientific tools than by 
pursuing his physics research. Reinventing 
Discovery, his thought-provoking call to 
arms, suggests he was right. 

Nielsen is convinced that, as a result of 
our growing ability to share information 
and ideas, we are living through a revolution 


The Arsenic Century: How Victorian Britain was 
Poisoned at Home, Work, and Play 

James C. Whorton (Oxford Univ. Press, 2011; £9.99) 
Arsenic in the nineteenth century was ubiquitous 
— stored in kitchens as a rat killer, or used to 

dye sweets, candles and gloves. Historian James 
Whorton shows how its deadly past resonates 
with the environmental poisonings of today. 
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Where Good Ideas Come from: The Seven 
Patterns of Innovation 

Steven Johnson (Penguin, 2011; £9.99) 

Good ideas, says writer Steven Johnson, are 
rarely produced by lone geniuses. Innovation 
more often grows out of a network of minds, he 
argues. Universities offer the best chance for 
breakthroughs as they lack market pressures. 
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how it will be done is vaguely described at 
best. In Wolfe’s research programme, predic- 
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— for instance, through increasingly high- 
tech surveys of viral biodiversity in potential 
reservoir species, the monitoring of unusual 
animal die-offs, and the minute-by-minute 
screening of Google trends, Twitter and 
mobile-phone data. 

Gathering such data is laudable, but still 
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discovered in a potential reservoir species 
may not replicate in human cells; a virus that 
replicates in human cells may not transmit 
between humans; anda variety of epidemio- 
logical processes dictate that even if such a 
virus is able to transmit between hosts, it 
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dicting viral emergence therefore requires a 
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of genetics and epidemiology. 

I believe that this fad for prediction 
presents a greater danger: that we establish 
expectations so unrealistic that they are met 
with inevitable failure, in turn undermin- 
ing public confidence. Another unwelcome 
consequence, in my view, is the diversion of 
research funds from basic biological studies 
of viral emergence — such as defining the 
relationship between a virus and its host-cell 
receptor — to more speculative programmes, 
such as predicting future viral evolution. 

Wolfe has become the public face of emerg- 
ing disease. This enjoyable, well researched 
and thought-provoking book shows that he 
has a clear vision of how pandemics occur 
in human populations and the part he 
might play in their prevention and control. 
Although it is not clear what the coming years 
will hold, one safe prediction is that Wolfe will 
have alot more to say. On the evidence of this 
book, he is worth listening to. m 
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Nielsen. In 2007, he announced on his blog 
that he felt he could have more impact by 
developing new scientific tools than by 
pursuing his physics research. Reinventing 
Discovery, his thought-provoking call to 
arms, suggests he was right. 

Nielsen is convinced that, as a result of 
our growing ability to share information 
and ideas, we are living through a revolution 
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James C. Whorton (Oxford Univ. Press, 2011; £9.99) 
Arsenic in the nineteenth century was ubiquitous 
— stored in kitchens as a rat killer, or used to 
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Whorton shows how its deadly past resonates 
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Where Good Ideas Come from: The Seven 
Patterns of Innovation 

Steven Johnson (Penguin, 2011; £9.99) 

Good ideas, says writer Steven Johnson, are 
rarely produced by lone geniuses. Innovation 
more often grows out of a network of minds, he 
argues. Universities offer the best chance for 
breakthroughs as they lack market pressures. 


in how knowledge is 
constructed. He argues 
that, just as the wide- 
spread adoption of the 
scientific method in 
the seventeenth cen- 
tury made discovery 
a realistic aim for any- 
one who could devise 
Reinventing the right experiment, 
Discovery: The so the rise of data- 


New Era of ‘ : : 
Networked Science driven discovery will 


MICHAEL NIELSEN transform the ability of 
Princeton University the average scientist to 
Press: 2011.280pp. contribute. Although 
$24.95, £17.95 there is room for the 


lone genius, the rest of 
us need all the help we can get. 

Researchers should take advantage of the 
emergence of this collective intelligence to 
tackle more difficult problems, Nielsen sug- 
gests. In one chapter (on which I gave feed- 
back), he describes a variety of web-based 
‘citizen science’ and other case studies to 
marshal his points. Some have research as 
their goal, such as my Galaxy Zoo project 
(www.galaxyzoo.org), which enlists the 
public’s help in classifying galaxies, and 
the collaborative mathematics carried out 
by participants in the Polymath projects 
(www.polymathprojects.org). But a strength 
of Nielsen’s book is that many more of the 
examples come from beyond academia. 

For instance, a virtual game of chess in 
1999 pitted the then world champion Garry 
Kasparov against ‘the world; as represented 
by the votes of some 50,000 online par- 
ticipants. It was more keenly fought than 
expected. The game hinged on a critical 
move by ‘the world’ that was proposed by 
US women’s champion Irina Krush. Moving 
the black queen to fork two pawns, Krush 
exploited her knowledge of a previous situ- 
ation shed studied to open up a novel posi- 
tion. She wasn't in Kasparov's league, but she 
had the ability and knowledge to exploit the 
situation in which her team found itself. 

This idea of micro-expertise underlies 
Nielsen’s vision of networked discovery. If 
enough people are exposed toa problem, he 
explains, the odds of one of them having the 
special insight or analytical ability needed 
to solve it increases, and so does the speed 
of discovery. 


Internet Alley: High Technology in Tysons 
Corner, 1945-2005 

Paul E. Ceruzzi (MIT Press, 2011; $14.95) 
Historian Paul Ceruzzi uncovers the story of the 
hub in Virginia from where much of the Internet is 
managed and governed. “Ceruzzi chronicles the 
evolution of Internet Alley astutely and accurately,” 
wrote Joel Shurkin (Nature 452, 533; 2008). 


BOOKS & ARTS | COMMENT | 


The key is that, as with Krush, the right 
person is faced with the right problem at the 
right time. What links the successful case 
studies in the book is that each has a system 
for directing the attention of the multitude 
of participants. Posts on the Polymath blog, 
for example, are structured so that each 
contains only one insight, making it easier 
to direct your contributions. It is this sort 
of systematic focusing of attention that 
moves us beyond the traditional ‘wisdom of 
crowds to a more powerful and subtle col- 
lective intelligence, capable of solving even 
complex problems. 

It is tempting to rail against the lack of 
romance in Nielsen’s collective vision, but a 
division of labour has been under way in sci- 
ence for decades. Many scientists are happy 
to leave the creation of large data sets to those 
who specialize in it: modern astronomers, 
for example, are more likely to write a data- 
base query than to visit a telescope. Last year 
saw the launch of ScienceExchange.com, 
which allows professional labs to sell their 
spare experimental capacity in a market 
tailored to that commodity. An exchange for 
inspiration, or at least for mental capacity, 
may not be far behind. 

Ina plea for an “open science imperative’, 
Nielsen argues that scientists need to do 
more than swap data and code. They need 
to share speculative thoughts — including 
half-finished ideas and open questions. 
Some are already doing this: take Rosie 
Redfield’s open-resource blog chronicling 
her microbiology experiments and bench- 
based musings from her lab at the University 
of British Columbia, Canada (see go.nature. 
com/ae82wp). But it is still a lonely endeav- 
our, and until a substantial number of lead- 
ing researchers take the plunge, it is likely to 
remain so. 

Nielsen ends simply, with the suggestion 
that we all try openness. The lack of a step- 
by-step programme for the revolution comes 
as a disappointment, but it is to his credit 
that he convinces us that radical change is 
a real possibility. Reinventing Discovery will 
frame serious discussion and inspire wild, 
disruptive ideas for the next decade. m 


Chris Lintott is an astrophysicist at the 
University of Oxford, UK. 
e-mail: cjl@astro.ox.ac.uk 


Leonardo’s Legacy: How Da Vinci Reimagined 
the World 

Stefan Klein (Da Capo Press, 2011; $16) 
Leonardo da Vinci designed a working robot, a 
heart valve and weapons. Science writer Stefan 
Klein analyses aspects of the artist’s scientific 
genius, from well-known projects such as his 
flying machine to other, more obscure inventions. 
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Capturing free will 


Jacek Debiec enjoys two complementary books charting 
the psychology and neuroscience of decision-making. 


rong or right, we are constantly 
making choices. Decision-mak- 
ing is not just about choosing 


careers, political representatives or the per- 
son you want to share your life with. Most 
of the time it is deciding where to put your 
foot as you take your next step, when to take 
a bite of your sandwich or whether to scratch 
that itch on your shoulder. Over your life- 
time, you make trillions of decisions, mostly 
remaining unaware of doing so. 

Two books examine different aspects 
of how we make judgements and choices. 


Oliver Sacks (Picador, 2011; £8.99) 


Thinking, Fast and Slow 

DANIE AHNEMA 
Allen Lane/Farrar, Straus and Giroux: 2011. 
512 pp. £25/$30 


Who’s in Charge? Free Will and the Science 
of the Brain 
Ecco: 2011. 272 pp. $27.99, £18.24 

Psychologist Daniel Kahneman, a pioneer 
in the science of decision-making who won 
the 2002 Nobel prize in economics, sees the 
workings of the mind as the interaction of 


two systems, conscious and unconscious, that 
are continuously at play. In Thinking, Fast 
and Slow he shows why his ideas shook the 
existing model of rational decision-making. 
In Whos in Charge?, cognitive neurosci- 
entist Michael Gazzaniga tackles a related 
question: given the range of automatic 
and deliberative factors that underlie our 
choices, why do we feel so in control when 
we make them? 

The books complement each other beau- 
tifully. Kahneman provides a detailed, yet 
accessible, description of the psychologi- 
cal mechanisms involved in making deci- 
sions. Gazzaniga descends into the neural 
circuits that underpin those mechanisms. 
There are instructive contrasts too. Kahne- 
man focuses on practical dimensions of the 
choice ‘machinery; such as the role of atten- 
tion, where biases come from and ways of 
avoiding judgement errors. Gazzaniga, 
meanwhile, attempts to understand inherent 
philosophical issues, such as the roots and 
nature of consciousness and free will. Both 
books deliver an up-to-date review of the 
research in psychology and neuroscience. 

Kahneman proposes a dual scheme for 
decision-making in the brain. System I is 
fast, effortless and intuitive, whereas Sys- 
tem II is more deliberative and logical, and 
consequently slower. Both operate continu- 
ously while we are awake, with System I as 
the default controlling our actions, and Sys- 
tem II monitoring and activating whenever 
System I runs into difficulty. An experienced 
driver, for example, will drive ‘on autopilot’ 
until they reach something unexpected, such 
as a detour. They will then need System II 
to find their way through unfamiliar streets. 

Neither system, Kahneman shows, is 
foolproof on its own. You can suppress your 
intuitions but cannot turn them off at will: 
that is why errors of System I occur. But Sys- 
tem II is too slow to be efficient for making 
routine decisions. 

Discoveries regarding these two systems 
— their motivating factors, limitations, 
operating principles and the interplay 
between them — have had an impact on dis- 
ciplines ranging from psychology and eco- 
nomics to law and philosophy. Kahneman, 
through his work on how life satisfaction 
affects decision-making, has contributed 
to the development of positive psychology 
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A tumour led neurologist Oliver Sacks to lose his 


stereoscopic vision, prompting him to study other 


visual disorders and their effects on people. He 
reveals “the complex roles of sight in human life 
and the constitution of personal identity”, wrote 


Steve Silberman (Nature 467, 1036-1037; 2010). 
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Rebecca M. Jordan-Young (Harvard Univ. Press, 
2011; $19.95) 

Rebecca Jordan-Young reviews the research on 
supposed differences in male and female brains, 
arguing for more rigour. Virginia Valian called it “a 
welcome corrective” (Nature 470, 332-333; 2011). 


as a subfield. In law, descriptions of the 
origins of biases help us to understand the 
limitations of witness testimonies. 
Gazzaniga approaches the neural mecha- 
nism of decision-making by looking at the 
big picture. There is no central commander 
in our brain, he argues. Instead, 
there are millions of serially wired 
local circuits, diversified and dis- 
tributed across the brain, all run- 
ning together and making relevant 
decisions. Despite this wiring, we 
feel unified when making those 
decisions, and Gazzaniga looks 
at why. 
He gleans some insights from 
studies of ‘split-brain’ patients, 
who have had surgery to halt their 
intractable epileptic seizures by severing 
the corpus callosum that connects the brain 
hemispheres. Each half of their brain works 
in isolation and can be studied as such. 
Gazzaniga and his colleagues found that 
when the language centres of a person’s dom- 
inant hemisphere — usually the left — are 
disconnected from 
the other hemisphere 
and the information 
it processes, that per- 
son will still explain 
actions triggered by 


> NATURE.COM 
Philip Ball muses on 
Daniel Kahneman’s 
happiness studies: 
go.nature.com/Igsswh 


the non-dominant hemisphere with the 
most probable answer. For example, when a 
picture of a bell was flashed to one subject’s 
non-dominant hemisphere and the word 
‘music was shown to the dominant one, 
the person — asked to select an image best 


BELIEVING IN FREE WILL COULD 


BOLSTER 


A PERSON’S CONTROL OF THEIR 


AUTOMATIC IMPULSE. 


depicting music — pointed to one of a bell. 
He explained his choice by saying that he had 
last heard the sound of a bell. He had not, but 
he made up a story that seemed probable. 
Our dominant hemisphere is hard-wired to 
function as an interpreter, Gazzaniga says — 
it is trying to make sense of our actions given 
the available information. If information is 
missing, as in the above example, the inter- 
preter will make up probable answers. It 
searches for patterns and attributes causes 
after an event occurs. It also responds to 
input about our actions, emotions and 
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thoughts, and links them together to build 
a narrative. Over time, these accumulated 
impressions give rise to a feeling of agency 
and an impression of free will. 
That impression could affect decision- 
making in socially important ways. People 
who were asked to read inspiring 
passages about free will versus 
deterministic depictions ofhuman 
actions were found to cheat less 
when tested afterwards. So, Gaz- 
zaniga notes, believing in free will 
could bolster a person's control of 
their automatic impulse — in this 
example, causing them to act less 
selfishly. The mind, as a property 
of the brain, can regulate some of 
the brain’s functions, just as soft- 
ware defines how hardware is used. 
Kahneman and Gazzaniga take us on a 
journey into the machinery of human deci- 
sion-making, its constrictions and flaws. By 
studying how we make choices, we can learn 
to make better ones. = 


Jacek Debiec is a psychiatrist at the 
New York University Child Study Center, 
New York, USA, and a researcher at the 
Copernicus Center for Interdisciplinary 
Studies, Krakow, Poland. 

e-mail: jacek.debiec@nyu.edu 
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Rid Russian science 
of corruption 


Your claim that “Russian science 
is recovering” is not strictly true 
(Nature 477, 5; 2011). Although 
there has been a large influx of 
money into research recently, 

its distribution seems in many 
cases to have been overshadowed 
by allegations of corruption and 
power play. 

Six mega-grant projects in 
high-energy physics, announced 
this summer by Prime Minister 
Vladimir Putin, were never 
subject to open scientific debate. 
Most of them will be managed 
by the Kurchatov Institute in 
Moscow, whose director, Mikhail 
Kovalchuk, belongs to Putin's 
‘inner circle (Nature 453, 702- 
703; 2008). The Scientific Cadres 
federal programme, which is 
managed by the Ministry of 
Science and Education and is 
formally a competition, has 
deteriorated into a distribution 
of contracts in narrowly defined 
areas, with some winners 
apparently being pre-selected. 

All of this eats into the core 
funding for basic science. 

The budget of the Russian 
Foundation for Basic Research 
(RFBR), the only funding body 
with a grant system similar to 
the Western model, was cut 
sharply last year and will not 
increase until at least 2014. The 
average RFBR grant is about 
US$12,000 a year, and even that 
is in danger — the foundation's 
council is aiming to increase the 
number and size of ‘innovative’ 
technological projects in only a 
few areas. 

The bureaucratic problems 
you mention make life difficult 
for mega-grant holders, but at 
least they can communicate 
directly with science minister 
Andrei Fursenko and even 
President Dmitry Medvedev. 
For the less fortunate majority 
of Russian scientists, these 
problems make doing science, 
especially experimental science, 
almost impossible. 

The international community 
helped Russian science in the 


1990s, when collaboration and 
international grants offered 

a lifeline for many Russian 
groups. Now we need the voices 
of our colleagues around the 
world to press Russias scientific 
leadership into accepting 
international norms in funding 
and decision-making, and to 
clear Russian science of any 
cronyism and corruption. 
Mikhail Gelfand 

A. A. Kharkevich Institute for 
Information Transmission 
Problems, Russian Academy of 
Sciences, Moscow, Russia. 
gelfand@iitp.ru 


Italian quake: critics’ 
logic is questionable 


The manslaughter case against 
Italian scientists for inadequately 
warning local residents before 
the April 2009 earthquake in 
L'Aquila, Italy (Nature 477, 
264-269; 2011), is not justified. 

At the heart of this case lies 
one fact — the danger to L Aquila 
residents at the time of the 
earthquake. More precisely, what 
was the chance that a particular 
individual would be killed in 
the subsequent 24 hours, given 
the frequency of low-magnitude 
tremors around that time and 
the best available science? That 
probability, even in L’Aquila's 
weakest class of building, was 
estimated at less than 1 in 
100,000 on the night of the 
earthquake (T. van Stiphout et al. 
Geophys. Res. Lett. 37, L06306; 
2010). The occurrence of many 
little earthquakes does not make 
the chance of a big earthquake 
very high. 

Conventional wisdom suggests 
that roughly US$1 million needs 
to be spent on public-safety 
measures for each life that would 
be saved. It might have been 
wise to improve L’Aquila’s at-risk 
buildings, particularly as it has 
been known for decades that 
many of them are too weak to 
withstand earthquakes. 

But the hazard level in L’Aquila 
in the days before the earthquake 
was insufficient, by two to three 
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orders of magnitude, to justify 
evacuation of even the weakest 
buildings. The scientists were 
right: sitting tight was a good 
recommendation and, in view 
of the low risk, all the published 
quotes from the seismologists 
were accurate. 

The most troubling aspect 
is the complete absence of a 
quantitative assessment of risk 
among the people seeking to 
condemn the scientists. To critics, 
it makes no difference whether 
the likelihood of a magnitude-6.3 
earthquake was tiny. 

The critics argument that 
a sterner warning should 
have been broadcast is based 
only on the fact that the event 
occurred, and so should have 
been foreseen. This logic seems 
to lie midway between Monty 
Python and Franz Kafka, 
and is terrifying to me as the 
seismologist responsible for 
monitoring the US Pacific 
Northwest, an area inhabitated 
by 10 million people. 
John E. Vidale University of 
Washington, Seattle, USA. 
vidale@uw.edu 


Italian quake: 
science rides politics 


As you report, the case against 
Italy’s National Commission 

for Forecasting and Predicting 
Great Risks hinges not on 

its inability to forecast the 
magnitude-6.3 earthquake that 
struck L’Aquila in April 2009, but 
on its provision of “incomplete, 
imprecise and contradicting 
information’ (Nature 477, 264- 
269; 2011). The case highlights 
a central problem in Italy — the 
incestuous relationship between 
science and politics. 

To some, the commission 
seemed more interested in 
pacifying the local population 
than in giving clear advice about 
earthquake preparedness. 

When the American 
Geophysical Union and the 
American Association for the 
Advancement of Science issued 
statements in support of the 


indicted Italian scientists, I 
pointed out to these organizations 
that the international scientific 
community should be made 
aware of the real content of the 
indictment made by attorney 
Fabio Picuti before they sent 
statements of support to President 
Giorgio Napolitano. Otherwise, 

it would appear that the scientific 
community was trying to 
influence the Italian president on 
the basis of a false premise. 

In Italy, a conflict of interest 
between some scientists and 
politics is the norm. In too many 
cases, scientists prefer to go 
along with, and even to defend, 
political decisions, rather than to 
act independently. Examples of 
this include the building of the 
largest civil hospital in southern 
Italy on the slope of Mount 
Vesuvius (G. Rolandi J. Volcanol. 
Geotherm. Res. 189, 347-362; 
2010), and plans to dispose 
of radioactive waste near the 
southern town of Scanzano 
Jonico — a seismic area near oil 
and gas fields, where the water 
table is close to the surface (B. De 
Vivo Geologia dell’Ambiente 
Anno XII 2, 1-8; 2004). Both 
cases proceeded with the support 
or silence of most of the Italian 
scientific community. There are 
many other such cases. 

I strongly believe that 
scientists should clearly explain 
the limits of their knowledge to 
the public. They must provide 
leadership and not patronize. 
The most important goal has to 
be the protection of human life. 
Benedetto De Vivo University of 
Naples Federico II, Naples, Italy. 
bdevivo@unina.it 
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Rudolf L. Méssbauer 


(1929-2011) 


A physicist who revitalized German science by creating a new type of spectroscopy. 


hen Rudolf Méssbauer found 
in 1957 that y-rays emitted by 
iridium-191 could be absorbed 


by a target of the same isotope without any 
loss of energy, it was soon obvious that he 
had discovered a new basis for spectroscopy. 
Just four years later, at the age of 32, he was 
awarded the Nobel Prize in Physics. 

Now named after him, Méssbauer 
spectroscopy is applied in fields ranging 
from chemistry to conservation. It is ideal 
for very fine-resolution work, such as deter- 
mining the shift of spectral lines in Earth’s 
gravitational field. It has even been used 
beyond Earth — to analyse the composi- 
tion of rocks on Mars. 

Born in 1929 in Munich, Germany, 
Mossbauer was a talented piano player but 
decided to study physics at the Technical 
Institute of Munich (now the Technical 
University of Munich). His prizewinning 
discovery emerged from his PhD work 
investigating y-ray absorption, done 
between 1955 and 1958 under Heinz Maier- 
Leibnitz at the Max Planck Institute for 
Medical Research in Heidelberg. In 1960, he 
moved to the California Institute of Technol- 
ogy (Caltech) in Pasadena, and received the 
Nobel prize a year later. 

But he didn’t stay in America for long, 
moving back to the Technical Institute of 
Munich in 1964. The Bavarian government 
was willing to pay a high price for his return 
—anew physics building and ten professor- 
ships. But the gain for physics in Germany 
was tremendous. Méssbauer’s new depart- 
ment brought a blast of fresh air to Munich 
and enhanced the 


scientific climate “Ultimately, 
nationwide. Several Méssbauer 
German scientistswho Saw science as 
worked in the United the language 
States came back to connecting all 
Germany in what the people of 
was called the ‘second fhe world.” 


Mossbauer effect. 

Mossbauer remained in Munich until 
his retirement in 1997. Between 1972 and 
1975, he took leave to serve as director of the 
Laue-Langevin Institute in Grenoble, France. 

The power of Mossbauer spectroscopy 
stems from two facts. Nuclei bound in a 
solid (a crystal, glass or viscous liquid) are 
unaffected (with some probability) by recoil 
when they emit or absorb y-radiation, and so 
do not gain or lose energy. Moreover, the fre- 
quency range of these y-rays — the natural 


width — is determined by the nuclear life- 
time and is not broadened by the Doppler 
effect. The spread of energies absorbed or 
emitted by a particular isotope is extremely 
narrow, giving the technique its high energy 
resolution. 

Mossbauer realized that y-rays emitted 
by a nucleus could be reabsorbed by nuclei 
of the same isotope in another sample. Any 
difference between the environment of the 
two nuclei causes an energy shift. Such shifts 
yield information about the state of the elec- 


trons of the isotope, and hence the nature 
of the solid containing it. Properties such as 
magnetism and flexibility (of protein mol- 
ecules, for example) can be derived. 

Applying his discovery across many fields, 
MOssbauer and his colleagues produced 
an explosion of papers in nuclear physics, 
solid-state physics, chemistry and biophys- 
ics — especially when the Mossbauer effect 
on iron-57 was found. Other Méssbauer iso- 
topes were detected, such as tantalum-181, 
which has an extraordinarily narrow natural 
linewidth and thus fine energy resolution. 
Measurements on iron and tin compounds 
led to breakthroughs on their valence and 
electronic configurations. 

Once such experiments became ‘routine, 
MOssbauer’s interest shifted. He stud- 
ied neutrinos through his involvement 
in the GALLEX experiments at the Gran 
Sasso National Laboratory in Italy. And he 
turned his spectroscopic method to exotic 


applications, such as the structure and 
dynamics of proteins, and reflections and 
anomalous transmissions in perfect single 
crystals. 

But ultimately, Méssbauer saw science as 
the language connecting all the people of 
the world. He set up many international col- 
laborations during his career. Strong links 
with Russia began when, during his time 
at Caltech, scientists at the USSR Academy 
of Sciences invited American physicists to 
Moscow to discuss possible cooperation. 
Because the cold war was still going on, no 
prominent American scientist wanted to 
go — so Méssbauer did. That visit was the 
starting point of long-running exchanges 
with Vitalii Goldanskii at the academy’s 
Institute of Chemical Physics and with Yuri 
Kagan at the Kurchatov Institute in Mos- 
cow. Méssbauer became a foreign member 
of the USSR Academy of Sciences in 1982. 


THE TRAIN FROM TEHRAN 

After Méssbauer returned to Munich, he 
and Goldanskii exchanged students and 
scientists. They organized the first common 
seminar in Moscow in 1975, out of which 
grew a series held alternately in the Soviet 
Union and Germany. In 1977, one was to 
be held in Yerevan, Armenia. When he was 
invited to visit Iran during the week before 
the seminar, Méssbauer looked at the map 
and decided to go by train from Tehran to 
Yerevan, not realizing that the train would 
stop in the desert between the two cities. His 
delegation waited in Yerevan for two days for 
their lost chief. Méssbauer also maintained 
excellent contacts with many other coun- 
tries, especially the United States. 

Mossbauer placed great value on teaching. 
He gave brilliant lectures, and never went to 
conferences during the school term. He gave 
his students — including me — freedom, but 
he helped when problems arose. 

When the University of Miinster offered 
me a professorship in 1982, Méssbauer 
decided that my research field should go 
with me. He generously arranged for me 
to take all of my equipment, so that I could 
make the best start. Always collaborative, 
he did not wish to compete with a former 
student and colleague. m 


Fritz Parak is a professor (retired) in the 
Department of Physics, Technical University 
of Munich, 85748 Garching, Germany. 
e-mail: fritz.parak@ph.tum.de 
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PARTICLE PHYSICS 


Sterile neutrinos 


An analysis of neutrino data suggests that there may be additional types of neutrinos beyond the three currently known. 
If confirmed, the existence of these additional particles could have an impact on astrophysics and cosmology. 


WILLIAM C. LOUIS 


eutrinos are the second most abundant 

particle type in the Universe, after 

photons. Yet little is known about 
them. In particular, neutrino masses are not 
known, nor is the number of neutrino types. 
Writing in Physical Review Letters, Kopp 
et al.' analysed all the available neutrino 
data and suggest that there may be addi- 
tional neutrino types beyond the three 
active neutrinos — the electron neutrino, 
muon neutrino and tau neutrino — that inter- 
act with matter by the weak interaction. These 
additional neutrino types would interact only 
by gravity and not by the weak interaction, 
and would therefore be ‘sterile. Through their 
mixing with active neutrinos and their inter- 
actions with gravity, these sterile neutrinos 
could have a big effect on astrophysics and 
cosmology. 

The number of active neutrinos has been 
determined to be three from measurements 
of the width of an elementary particle known 
as the Z° boson, performed at the Large Elec- 
tron—Positron (LEP) collider at CERN near 
Geneva in Switzerland. The Z° boson medi- 
ates the weak interaction, so it can decay into 
all of the quarks and leptons that interact by 
the weak interaction (quarks and leptons are 
two families of elementary particles; the latter 
includes the neutrinos). The width of the Z° is 
proportional to the number of particle decay 
channels and inversely proportional to the 
Z lifetime. Any additional types of neutrino 
would have to be sterile neutrinos so that they 
would not interact by the weak interaction and 
would not affect the width of Z°. 

How do we search for sterile neutrinos? 
One way is to look for cosmological evidence 
of sterile neutrinos through their gravita- 
tional effects on galaxy formation and the 
evolution of the Universe. However, this tech- 
nique assumes that we know the density of 
sterile neutrinos in the Universe and is model- 
dependent. A more direct method is to search 
for oscillations between active and sterile 
neutrinos. 

Neutrino oscillations, observed through the 
transmutation of neutrinos of one type into 
neutrinos of another type, occur if there is mix- 
ing between neutrino types and if individual 
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Figure 1 | Neutrino quintet. The best-fit model 
to the world neutrino data analysed by Kopp et al.' 
makes use of five neutrino types (three active and 
two sterile) and five different neutrino masses 

(V1, V2, V3, V4, Vs)> Which are shown here but are 

not to scale. The proportions of the three active 
neutrino types that contribute to the different 
masses are represented by grey (electron neutrino), 
blue (muon neutrino), and red (tau neutrino), 
whereas those of the two sterile neutrino types 

are represented by yellow and white. 


neutrino types consist of a linear combination 
of different neutrino masses. (At present, the 
masses and mixings of the fundamental quarks 
and leptons can be measured but are not fully 
understood.) In the case of two-neutrino mix- 
ing — for example, mixing between the muon 
neutrino and the electron neutrino — the 
probability (P) that a muon neutrino (v,,) will 
oscillate into an electron neutrino (v,) is given 
by P(vy,> v,.) = sin?(26)sin?(1.27Am’L/E). 
Here, 6, in radians, describes the mixing 
between the muon neutrino and electron 
neutrino; Am’ is the difference of the squares 
of the masses of the two neutrinos in square 
electronvolts; L is the distance travelled by 
the muon neutrino in kilometres; and E is the 
muon-neutrino energy in gigaelectronvolts. 
In general, the number of different neutrino 
masses equals the number of neutrino types, 
so that three-neutrino mixing involves three 
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neutrino masses and two independent Am* 
values, whereas five-neutrino mixing involves 
five neutrino masses and four independent 
Am’ values. Neutrino oscillations have been 
observed at a Am’ of about 7x10 eV’ by 
detectors that measure the flow of neutrinos 
from the Sun and experiments that detect 
neutrinos at a long distance from nuclear 
reactors. The oscillations have been detected 
at a Am’ of around 2x10~ eV’ by detectors 
that measure the flow of neutrinos from the 
atmosphere and by experiments in which 
neutrinos are measured at a long distance 
from particle accelerators. In addition to these 
confirmed observations of neutrino oscilla- 
tions, there is also evidence for oscillations 
at a Am’ of about 1 eV’ from short-distance 
accelerator and reactor neutrino experi- 
ments” *. However, it is not possible to explain 
this third Am’ value with only three neu- 
trino masses. Therefore, additional neutrino 
masses are required. 

In their study, Kopp et al.' tried fitting the 
world neutrino-oscillation data to theoreti- 
cal models involving four different neutrino 
masses (three active neutrinos plus one ster- 
ile neutrino) and then five different neutrino 
masses (three active plus two sterile neutrinos; 
Fig. 1). They found that one sterile neutrino 
was insufficient to explain the world data, but 
two gave a satisfactory global fit. (Similar fits 
are discussed elsewhere” ’°.) One other fea- 
ture of the authors’ two-sterile-neutrino fit 
is that it allows for violation in leptons of the 
charge-parity (CP) symmetry — according 
to which particles and antiparticles behave 
like mirror images of each other — or for a 
difference between neutrino oscillations and 
antineutrino oscillations. Such CP violation 
might help to explain the r-process, in which 
heavy elements are produced through nuclear 
reactions involving rapid neutron capture 
(hence the ‘r’), and the production of heavy 
elements in neutrino bursts from stellar explo- 
sions known as supernovae. It might also help 
to explain why the Universe is dominated by 
matter and not by an equal amount of matter 
and antimatter. 

Does this mean that sterile neutrinos exist? 
Before we can begin to answer this question, 
we must also realize that although the best-fit 
probability is good, two tensions exist in the 


data: that between appearance and disappear- 
ance oscillation experiments (appearance 
experiments measure the appearance of a 
second neutrino type, whereas disappearance 
experiments measure the disappearance of the 
initial neutrino type); and the tension between 
global fits to neutrino oscillation data and to 
cosmological data that measure the total neu- 
trino mass in the Universe (the best fit from 
Kopp and colleagues’ data has a total neutrino 
mass of about 1.7 eV, whereas cosmological 
data set a mass limit of less than 0.7-1.5 eV). 
This latter tension is especially important 
because, as Kopp et al. state, sterile neutrino 
explanations “would indicate a deviation from 
the standard cosmological picture”. 

For the future, it will be crucial to test this 
evidence for sterile neutrinos with new and 


TUMOUR BIOLOGY 


improved experiments. Indeed, several experi- 
ments being planned or already running in 
the United States, Europe and Asia will have 
sensitivity to sterile neutrinos and to masses 
of about 1 eV. These include experiments that 
make use of a radioactive neutrino source, 
reactor-neutrino experiments, accelerator- 
neutrino experiments and even the IceCube 
experiment at the South Pole, which detects 
atmospheric neutrinos of very high energy 
(10” eV). If their existence is confirmed, 
sterile neutrinos could have a major impact on 
nuclear physics, particle physics, astrophysics 
and cosmology. It would be ironic if a parti- 
cle that interacts so weakly could affect the 
Universe so strongly. = 


William C. Louis is in the Physics Division, 


Skin-cancer stem 
cells outwitted 


Skin-cancer stem cells secrete a factor that organizes a blood-supply system 
to fuel tumour growth. But the same factor has another sinister function — it 
stimulates the stem cells to propagate uncontrollably. SEE LETTER P.399 


SALVADOR AZNAR BENITAH 


r | Vhe most frequently diagnosed cancers 
in industrialized countries are non- 
melanoma skin cancers, including 

squamous-cell and basal-cell carcinomas’. 

Like most other solid tumours, squamous-cell 

carcinomas harbour a subset of cells known 

as cancer stem cells**, which initiate and 
propagate the tumour by a hitherto unknown 
mechanism. On page 399 of this issue, Beck 
and colleagues’ reveal that these cells secrete 
copious amounts of a growth factor that uses 

a two-pronged strategy to ensure that tumour 

growth continues indefinitely. They show that 

obstructing this dual activity causes the pool 
of cancer stem cells to shrink and the tumour 
to regress. 

The growth factor that Beck et al. find to be 
secreted in large quantities by skin cancer stem 
cells (CSCs) is VEGE In a process known as 
angiogenesis, VEGF attracts endothelial cells 
that line blood vessels and stimulates their pro- 
liferation, creating a vascular network to sup- 
ply the growing tumour with essential oxygen 
and nutrients’. VEGF signals to endothelial 
cells by binding to a specific receptor, known 
as VEGFR2, on the cell membrane (Fig. 1). 
When the authors used antibodies against 
VEGEFR2 to block the receptor in mice with 
skin tumours, this not only prevented angio- 
genesis, but also caused the pool of CSCs to 


decrease, which retarded tumour growth. 

CSCs were also able to transduce sig- 
nals from VEGF with the help of a trans- 
membrane co-receptor called neuropilin-1 
(Nrp 1) that works with the primary VEGF 
receptor (VEGFR). Through Nrp1, VEGF 
stimulates the expression of genes that are 
enriched in CSCs — for example, those 
responsible for cell proliferation, such as 
cyclin D1, as well as genes such as Sox2 that 
confer the unique features of stem cells, or 
‘stemness. Cutaneous CSCs therefore renew 
themselves in response to the VEGF they 
secrete — a cell-autonomous, self-communi- 
cating mechanism referred to as an autocrine 
loop (Fig. 1). Strikingly, VEGF causes the CSC 
pool to increase even in the absence of a new 
blood-supply network. 

The dual response of CSCs to VEGF could 
explain these cells’ ability to both initiate and 
propagate tumours. These same pathways may 
enable CSCs to re-form a complete carcinoma 
when transplanted in low numbers, or even 
as single cells, in experimental mouse mod- 
els of skin cancer””’, and allow them to leave 
a patient’s primary tumour and establish a 
metastatic tumour in a distant organ. 

Inhibiting VEGF signalling has already been 
shown to reduce squamous tumour initiation 
in vivo’. And, as Beck et al.* show, selective 
inhibition of VEGF signalling in CSCs, but 
not vascular cells, causes a drastic drop in CSC 
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Los Alamos National Laboratory, Los Alamos, 
New Mexico 87545, USA. 
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Figure 1 | Two-pronged effects of VEGF. Beck 
et al." find that skin-cancer stem cells secrete large 
quantities of the growth factor VEGE This protein 
binds to the VEGFR2 receptor on the surface of 
endothelial cells that line nearby blood vessels, 
promoting angiogenesis. VEGF also seems to bind 
to the VEGF receptor VEGFR and its co-receptor, 
Nrp1, on the surface of cancer stem cells 
themselves, further stimulating tumour growth 
through this autocrine mechanism. 
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data: that between appearance and disappear- 
ance oscillation experiments (appearance 
experiments measure the appearance of a 
second neutrino type, whereas disappearance 
experiments measure the disappearance of the 
initial neutrino type); and the tension between 
global fits to neutrino oscillation data and to 
cosmological data that measure the total neu- 
trino mass in the Universe (the best fit from 
Kopp and colleagues’ data has a total neutrino 
mass of about 1.7 eV, whereas cosmological 
data set a mass limit of less than 0.7-1.5 eV). 
This latter tension is especially important 
because, as Kopp et al. state, sterile neutrino 
explanations “would indicate a deviation from 
the standard cosmological picture”. 

For the future, it will be crucial to test this 
evidence for sterile neutrinos with new and 
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improved experiments. Indeed, several experi- 
ments being planned or already running in 
the United States, Europe and Asia will have 
sensitivity to sterile neutrinos and to masses 
of about 1 eV. These include experiments that 
make use of a radioactive neutrino source, 
reactor-neutrino experiments, accelerator- 
neutrino experiments and even the IceCube 
experiment at the South Pole, which detects 
atmospheric neutrinos of very high energy 
(10” eV). If their existence is confirmed, 
sterile neutrinos could have a major impact on 
nuclear physics, particle physics, astrophysics 
and cosmology. It would be ironic if a parti- 
cle that interacts so weakly could affect the 
Universe so strongly. = 


William C. Louis is in the Physics Division, 


Skin-cancer stem 
cells outwitted 


Skin-cancer stem cells secrete a factor that organizes a blood-supply system 
to fuel tumour growth. But the same factor has another sinister function — it 
stimulates the stem cells to propagate uncontrollably. SEE LETTER P.399 


SALVADOR AZNAR BENITAH 


r | Vhe most frequently diagnosed cancers 
in industrialized countries are non- 
melanoma skin cancers, including 

squamous-cell and basal-cell carcinomas’. 

Like most other solid tumours, squamous-cell 

carcinomas harbour a subset of cells known 

as cancer stem cells**, which initiate and 
propagate the tumour by a hitherto unknown 
mechanism. On page 399 of this issue, Beck 
and colleagues’ reveal that these cells secrete 
copious amounts of a growth factor that uses 

a two-pronged strategy to ensure that tumour 

growth continues indefinitely. They show that 

obstructing this dual activity causes the pool 
of cancer stem cells to shrink and the tumour 
to regress. 

The growth factor that Beck et al. find to be 
secreted in large quantities by skin cancer stem 
cells (CSCs) is VEGE In a process known as 
angiogenesis, VEGF attracts endothelial cells 
that line blood vessels and stimulates their pro- 
liferation, creating a vascular network to sup- 
ply the growing tumour with essential oxygen 
and nutrients’. VEGF signals to endothelial 
cells by binding to a specific receptor, known 
as VEGFR2, on the cell membrane (Fig. 1). 
When the authors used antibodies against 
VEGEFR2 to block the receptor in mice with 
skin tumours, this not only prevented angio- 
genesis, but also caused the pool of CSCs to 


decrease, which retarded tumour growth. 

CSCs were also able to transduce sig- 
nals from VEGF with the help of a trans- 
membrane co-receptor called neuropilin-1 
(Nrp 1) that works with the primary VEGF 
receptor (VEGFR). Through Nrp1, VEGF 
stimulates the expression of genes that are 
enriched in CSCs — for example, those 
responsible for cell proliferation, such as 
cyclin D1, as well as genes such as Sox2 that 
confer the unique features of stem cells, or 
‘stemness. Cutaneous CSCs therefore renew 
themselves in response to the VEGF they 
secrete — a cell-autonomous, self-communi- 
cating mechanism referred to as an autocrine 
loop (Fig. 1). Strikingly, VEGF causes the CSC 
pool to increase even in the absence of a new 
blood-supply network. 

The dual response of CSCs to VEGF could 
explain these cells’ ability to both initiate and 
propagate tumours. These same pathways may 
enable CSCs to re-form a complete carcinoma 
when transplanted in low numbers, or even 
as single cells, in experimental mouse mod- 
els of skin cancer””’, and allow them to leave 
a patient’s primary tumour and establish a 
metastatic tumour in a distant organ. 

Inhibiting VEGF signalling has already been 
shown to reduce squamous tumour initiation 
in vivo’. And, as Beck et al.* show, selective 
inhibition of VEGF signalling in CSCs, but 
not vascular cells, causes a drastic drop in CSC 
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Figure 1 | Two-pronged effects of VEGF. Beck 
et al." find that skin-cancer stem cells secrete large 
quantities of the growth factor VEGE This protein 
binds to the VEGFR2 receptor on the surface of 
endothelial cells that line nearby blood vessels, 
promoting angiogenesis. VEGF also seems to bind 
to the VEGF receptor VEGFR and its co-receptor, 
Nrp1, on the surface of cancer stem cells 
themselves, further stimulating tumour growth 
through this autocrine mechanism. 
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population and regression of the tumour. 

The CSCs in squamous tumours that 
Beck et al. studied were predominantly in 
niches closely associated with the underly- 
ing endothelial cells. Leukaemia and brain- 
tumour stem cells are also often found side 
by side with vascular cells’. There is strong 
evidence that CSCs ‘read’ factors released by 
neighbouring endothelial cells and that these 
factors are necessary for CSC maintenance 
(paracrine communication)*”, perhaps work- 
ing in synergy with the autocrine signalling 
through CSC-derived VEGF to sustain tumour 
stemness (Fig. 1). 

These findings raise several interesting 
questions. For instance, heterogeneous popu- 
lations of CSCs have been described’ in squa- 
mous-cell carcinomas that have comparable 
tumour-promoting potential but different 
proliferation rates. CSC heterogeneity is also 
evident in leukaemias and in solid tumours 
such as lung carcinomas’*”’. Are different 
CSC subtypes localized close to the vascular 
niche? Do other tumour types rely on the 
VEGEF autocrine loop? Does Nrp1 act alone or 
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does it cooperate with different primary VEGF 
receptors? One might also ask whether CSCs 
are equally dependent on autocrine VEGF 
and on communicating with neighbouring 
endothelial cells for their self-renewal, and 
whether VEGF is still as important in the later 
stages of cancer as it is in the initial stages. 
Developing strategies that precisely target 
the self-sustaining mechanism of CSCs — 
rather than blanket prevention of angio- 
genesis — might be therapeutically useful. 
One adverse side effect of anti-angiogenic 
cancer therapies is the deprivation of oxygen 
in tumours, which paradoxically enhances 
their metastatic potential’*. Therapies that 
preferentially inhibit VEGF signalling in 
CSCs — for instance by preventing inter- 
action between Nrpl and VEGF — might 
selectively prevent stem-cell self-renewal 
without creating pro-metastatic oxygen short- 
age. Although most of the vascularly secreted 
paracrine factors that affect CSC self-renewal 
are yet to be identified, blocking their activ- 
ity might also selectively eliminate CSCs 
with little anti-angiogenic effect. Beck and 


Deft tricks with 
liquid crystals 


Some biological macromolecules can control their own assembly into elegant 
hierarchical structures. Synthetic supramolecules are catching up fast, 
promising new advances for optical and biomedical materials. SEE LETTER P.364 


IVAN I. SMALYUKH 


Le crystals are widely used in electro- 


optics, photonics, sensors, artificial 

muscles and even in laboratory model- 
ling of the early Universe’ *. But even these 
applications are nowhere near as complex 
and important as the use of liquid crystals in 
biological systems, where they underpin the 
organization of both soft and hard tissues. On 
page 364 of this issue*, Chung et al. borrow 
an ingenious trick from biology to produce 
highly structured biomimetic materials from 
liquid crystals. Their discovery of how to fine- 
tune the assembly of such complex structures 
from identical building blocks could help to 
accelerate the large-scale fabrication of new 
functional materials. 

The simplest liquid crystals are typically 
rod-like molecules that spontaneously orient 
along a common direction while flowing 
under shear or gravity’. The chirality (hand- 
edness) of such molecules and their various 
degrees of positional ordering introduce a 
plethora of different phases that have unique 


combinations of order and fluidity. Biological 
tissues often resemble crystalline structures 
or liquid-crystal flow patterns frozen in time’, 
but the physical origins of their organization 
are not well understood. The synthetic tem- 
plating of biomimetic structures by Chung 
et al.* provides valuable insight into these 
processes. 

The authors use a rod-shaped bacteriophage 
(a virus that specifically infects bacteria), 
known as M13, as their starter building 
block. They dip-coat plates in dispersions of 
unoriented anisotropic phage particles while 
tuning the concentration and speed of extrac- 
tion of submerged glass substrates, so that, as 
the water evaporates, the particles concen- 
trate at the triple contact line of water, glass 
and air (Fig. 1a). The particles are carried to 
the meniscus by capillary flow to maintain the 
contact angle by replenishing the fast evapora- 
tive water losses near the pinned contact line. 
Such flows are ubiquitous in everyday life and 
cause, for example, the ‘coffee-ring’ stains in 
drying drops®”. As the concentration increases, 
the phage particles spontaneously align and 
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colleagues’ promising findings warrant further 
investigation into the molecular mechanism 
and therapeutic potential of blocking the 
autocrine VEGF/Nrp1 loop. = 
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form different phases (Fig. 1b,c). When the 
contact line recedes while the substrate is being 
extracted, the liquid-crystal structures become 
further enriched by flows and by the action of 
capillary forces, and are then imprinted in the 
structure of the ensuing film (Fig. 1d). 

By changing the speed of plate extraction 
and the initial concentration of phage par- 
ticles, Chung et al. obtain highly controlled 
structures that mimic various liquid-crystal 
phases, as well as complex patterns generated 
by the combined effects of confinement geom- 
etry, flow and the receding contact line. The 
patterns closely resemble those seen in some 
biological tissues’, supporting the idea that 
such tissues are ‘frozen liquid crystals’. 

The discontinuous receding and pinning of 
the contact line are important. Phage particles 
spontaneously align parallel to a pinned con- 
tact line to maximize their overall translational 
and rotational entropy in the wedge-shaped 
meniscus region. This gives rise to liquid- 
crystal phases that depend on concentration’, 
yielding both uniform and twisted structures 
that minimize the elastic energy of liquid 
crystals confined in the wedge-shaped 
menisci (Fig. 1b,c). 

The receding of the contact line as a result 
of pulling the substrate exerts ‘combing’ 
forces, which act on anisotropic particles at the 
fluid—air interface to minimize the surface- 
tension energy’; these forces tend to align 
the rods along the receding direction. As the 
contact-line dynamics are discontinuous and 
dependent on pulling speed, the interplay of 
these antagonistic entropic/elastic and comb- 
ing effects provides a simple means to control 
the biomimetic self-assembly of a host of com- 
plex structures (Fig. 1d). The authors* show 
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population and regression of the tumour. 

The CSCs in squamous tumours that 
Beck et al. studied were predominantly in 
niches closely associated with the underly- 
ing endothelial cells. Leukaemia and brain- 
tumour stem cells are also often found side 
by side with vascular cells’. There is strong 
evidence that CSCs ‘read’ factors released by 
neighbouring endothelial cells and that these 
factors are necessary for CSC maintenance 
(paracrine communication)*”, perhaps work- 
ing in synergy with the autocrine signalling 
through CSC-derived VEGF to sustain tumour 
stemness (Fig. 1). 

These findings raise several interesting 
questions. For instance, heterogeneous popu- 
lations of CSCs have been described’ in squa- 
mous-cell carcinomas that have comparable 
tumour-promoting potential but different 
proliferation rates. CSC heterogeneity is also 
evident in leukaemias and in solid tumours 
such as lung carcinomas’*”’. Are different 
CSC subtypes localized close to the vascular 
niche? Do other tumour types rely on the 
VEGEF autocrine loop? Does Nrp1 act alone or 
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does it cooperate with different primary VEGF 
receptors? One might also ask whether CSCs 
are equally dependent on autocrine VEGF 
and on communicating with neighbouring 
endothelial cells for their self-renewal, and 
whether VEGF is still as important in the later 
stages of cancer as it is in the initial stages. 
Developing strategies that precisely target 
the self-sustaining mechanism of CSCs — 
rather than blanket prevention of angio- 
genesis — might be therapeutically useful. 
One adverse side effect of anti-angiogenic 
cancer therapies is the deprivation of oxygen 
in tumours, which paradoxically enhances 
their metastatic potential’*. Therapies that 
preferentially inhibit VEGF signalling in 
CSCs — for instance by preventing inter- 
action between Nrpl and VEGF — might 
selectively prevent stem-cell self-renewal 
without creating pro-metastatic oxygen short- 
age. Although most of the vascularly secreted 
paracrine factors that affect CSC self-renewal 
are yet to be identified, blocking their activ- 
ity might also selectively eliminate CSCs 
with little anti-angiogenic effect. Beck and 
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optics, photonics, sensors, artificial 

muscles and even in laboratory model- 
ling of the early Universe’ *. But even these 
applications are nowhere near as complex 
and important as the use of liquid crystals in 
biological systems, where they underpin the 
organization of both soft and hard tissues. On 
page 364 of this issue*, Chung et al. borrow 
an ingenious trick from biology to produce 
highly structured biomimetic materials from 
liquid crystals. Their discovery of how to fine- 
tune the assembly of such complex structures 
from identical building blocks could help to 
accelerate the large-scale fabrication of new 
functional materials. 
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rod-like molecules that spontaneously orient 
along a common direction while flowing 
under shear or gravity’. The chirality (hand- 
edness) of such molecules and their various 
degrees of positional ordering introduce a 
plethora of different phases that have unique 


combinations of order and fluidity. Biological 
tissues often resemble crystalline structures 
or liquid-crystal flow patterns frozen in time’, 
but the physical origins of their organization 
are not well understood. The synthetic tem- 
plating of biomimetic structures by Chung 
et al.* provides valuable insight into these 
processes. 

The authors use a rod-shaped bacteriophage 
(a virus that specifically infects bacteria), 
known as M13, as their starter building 
block. They dip-coat plates in dispersions of 
unoriented anisotropic phage particles while 
tuning the concentration and speed of extrac- 
tion of submerged glass substrates, so that, as 
the water evaporates, the particles concen- 
trate at the triple contact line of water, glass 
and air (Fig. 1a). The particles are carried to 
the meniscus by capillary flow to maintain the 
contact angle by replenishing the fast evapora- 
tive water losses near the pinned contact line. 
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colleagues’ promising findings warrant further 
investigation into the molecular mechanism 
and therapeutic potential of blocking the 
autocrine VEGF/Nrp1 loop. = 
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Figure 1 | Self-assembly of liquid-crystal structures. Chung et al.’ have 
grown films in a single-step process in which substrates are dipped in and 

out of suspensions of rod-like particles (phage viruses) in water. a, When a 
glass substrate is withdrawn from a suspension, rapid water evaporation (red 
arrows) occurs at the contact line formed at the interface of the suspension, 
the substrate and the surrounding air. This evaporation, coupled with the need 
for the contact angle between the surface of the water and the substrate to be 
maintained as the substrate is withdrawn, causes capillary flow (blue arrows) 
that replenishes water at the contact line and carries particles to the meniscus. 


that this process can yield biomimetic films 
that have pigment-free structural colours. 
These structured films are of interest for use 
as reflectors and filters, for providing peri- 
odic patterns for applications in unconven- 
tional diffraction gratings, and for controlling 
mechanical properties of artificial materials 
that resemble those of tissues found in nature. 

In conventional liquid crystals, such as 
those used in mobile-phone displays, these 
structures would be unstable in the absence of 
external fields, relaxing to the ground state of 
the corresponding phases. In the set-up used 
by Chung et al., however, water drying and 
the corresponding loss of fluidity imprint the 
liquid crystals into the biomimetic morphol- 
ogy of thin structured films. Contact-line 
pinning and discontinuous receding are 
usually evident during natural water drying 
at menisci and during droplet motion under 
gravity, offering clues to the origins of many 
forms of natural materials and tissues. 

This type of biomimetic self-templating 
could extend the scope of application of 
many powerful fabrication techniques based 
on liquid-crystal self-organization and dip- 
coating””'’. As liquid-crystal behaviour is 
not restricted to rod-shaped particles, new 
liquid-crystal phases may be discovered in 
dispersions of particles that have low sym- 
metry. It will be of great interest, both from a 
fundamental viewpoint and for potential appli- 
cations, to explore how the approach taken by 
Chung et al.*can be extended to dispersions of 
particles of varying composition, shape, rigid- 
ity and size, as well as to composite dispersions 
comprising different types of particles. 

Biomimetic self-templating has the poten- 
tial to be used to engineer tissues to acquire 
sophisticated structure, to investigate 
links between liquid-crystal self-organiza- 
tion and morphogenesis” and to fabricate 


metamaterials, photonic crystals and organic 
photovoltaics, adding to an already impressive 
list of applications. m 
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b, Particles concentrated at the meniscus spontaneously form a ‘nematic 
liquid crystal in which the particles align with the contact line. c, Interactions 
between highly concentrated particles promote their assembly into twisted, 
helical structures. The wedge-shaped geometry of the suspension just below 
the contact line causes the particles to form features called Bouligand arches 
(red lines). d, The discontinuous receding of the contact line as the substrate is 
withdrawn (green arrow), and rapid water evaporation at the line, promote the 
mechanical imprinting of self-organized liquid-crystal patterns into the film 
that forms as particles are deposited on the substrate. 
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Stars acquire youth 
through duplicity 


The origin of unusually hot stars in a sparse cluster has been attributed to their 
being members of binary systems rather than stellar collisions. This prompts a 
rethink of how stars merge when they collide. SEE LETTER P.356 


CHRISTOPHER TOUT 


tars in dense clusters were mostly born 

at the same time, but a few seem to 

be much younger. The origin of these 
oddly young stars, which are known as blue 
stragglers, has been attributed to collisions 
between stars in the dense stellar environ- 
ment or to the accretion of matter from a close 
companion. On page 356 of this issue, Geller 
and Mathieu’ examine current compan- 
ions to blue stragglers and establish that the 


mass-transfer origin is prevalent in a sparse 
star cluster known as NGC 188. 

Two properties of any star — its bright- 
ness, or magnitude, and colour — can be eas- 
ily measured. Ifa star’s distance is known, its 
luminosity can be calculated from its magni- 
tude. Its colour is due to its temperature. In a 
plot of luminosity against temperature, known 
as a Hertzsprung—Russell diagram, stars lie in 
specific regions (Fig. 1). The most populous 
area is known as the main sequence, in which 
the hottest blue stars are the brightest and the 
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Figure 1 | Self-assembly of liquid-crystal structures. Chung et al.’ have 
grown films in a single-step process in which substrates are dipped in and 

out of suspensions of rod-like particles (phage viruses) in water. a, When a 
glass substrate is withdrawn from a suspension, rapid water evaporation (red 
arrows) occurs at the contact line formed at the interface of the suspension, 
the substrate and the surrounding air. This evaporation, coupled with the need 
for the contact angle between the surface of the water and the substrate to be 
maintained as the substrate is withdrawn, causes capillary flow (blue arrows) 
that replenishes water at the contact line and carries particles to the meniscus. 


that this process can yield biomimetic films 
that have pigment-free structural colours. 
These structured films are of interest for use 
as reflectors and filters, for providing peri- 
odic patterns for applications in unconven- 
tional diffraction gratings, and for controlling 
mechanical properties of artificial materials 
that resemble those of tissues found in nature. 

In conventional liquid crystals, such as 
those used in mobile-phone displays, these 
structures would be unstable in the absence of 
external fields, relaxing to the ground state of 
the corresponding phases. In the set-up used 
by Chung et al., however, water drying and 
the corresponding loss of fluidity imprint the 
liquid crystals into the biomimetic morphol- 
ogy of thin structured films. Contact-line 
pinning and discontinuous receding are 
usually evident during natural water drying 
at menisci and during droplet motion under 
gravity, offering clues to the origins of many 
forms of natural materials and tissues. 

This type of biomimetic self-templating 
could extend the scope of application of 
many powerful fabrication techniques based 
on liquid-crystal self-organization and dip- 
coating””'’. As liquid-crystal behaviour is 
not restricted to rod-shaped particles, new 
liquid-crystal phases may be discovered in 
dispersions of particles that have low sym- 
metry. It will be of great interest, both from a 
fundamental viewpoint and for potential appli- 
cations, to explore how the approach taken by 
Chung et al.*can be extended to dispersions of 
particles of varying composition, shape, rigid- 
ity and size, as well as to composite dispersions 
comprising different types of particles. 

Biomimetic self-templating has the poten- 
tial to be used to engineer tissues to acquire 
sophisticated structure, to investigate 
links between liquid-crystal self-organiza- 
tion and morphogenesis” and to fabricate 


metamaterials, photonic crystals and organic 
photovoltaics, adding to an already impressive 
list of applications. m 
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b, Particles concentrated at the meniscus spontaneously form a ‘nematic 
liquid crystal in which the particles align with the contact line. c, Interactions 
between highly concentrated particles promote their assembly into twisted, 
helical structures. The wedge-shaped geometry of the suspension just below 
the contact line causes the particles to form features called Bouligand arches 
(red lines). d, The discontinuous receding of the contact line as the substrate is 
withdrawn (green arrow), and rapid water evaporation at the line, promote the 
mechanical imprinting of self-organized liquid-crystal patterns into the film 
that forms as particles are deposited on the substrate. 
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Stars acquire youth 
through duplicity 


The origin of unusually hot stars in a sparse cluster has been attributed to their 
being members of binary systems rather than stellar collisions. This prompts a 
rethink of how stars merge when they collide. SEE LETTER P.356 


CHRISTOPHER TOUT 


tars in dense clusters were mostly born 

at the same time, but a few seem to 

be much younger. The origin of these 
oddly young stars, which are known as blue 
stragglers, has been attributed to collisions 
between stars in the dense stellar environ- 
ment or to the accretion of matter from a close 
companion. On page 356 of this issue, Geller 
and Mathieu’ examine current compan- 
ions to blue stragglers and establish that the 


mass-transfer origin is prevalent in a sparse 
star cluster known as NGC 188. 

Two properties of any star — its bright- 
ness, or magnitude, and colour — can be eas- 
ily measured. Ifa star’s distance is known, its 
luminosity can be calculated from its magni- 
tude. Its colour is due to its temperature. In a 
plot of luminosity against temperature, known 
as a Hertzsprung—Russell diagram, stars lie in 
specific regions (Fig. 1). The most populous 
area is known as the main sequence, in which 
the hottest blue stars are the brightest and the 
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cooler, redder stars are fainter. The 
Sun lies in this main-sequence 
region. 

Main-sequence stars quiescently 
convert hydrogen to helium in their 
cores by nuclear fusion. Fusion 
heats the star and so provides 
thermal pressure support against 
gravity. The heaviest, bluest stars 
are brighter and exhaust their cen- 
tral fuel supply first. At this point 
they leave the main sequence so 
that its bluemost end is not occu- 
pied. The bluest star of a cluster not 
to have left is said to be at the main- 
sequence turn-off. Beyond the 
turn-off, a star drastically changes 
in structure, with fusion moving 
from the core to a shell around it. 
As fusion continues, the shell eats 
outwards through the star, add- 
ing helium to the growing core as 
it goes. The star rapidly migrates 
to join the population of cool red 
giants of high luminosity and large 
radius that lie towards the red end 
of the main sequence. 

As observations of stars 
improved, researchers noticed’ that 
all clusters have a small fraction of 
stars — the blue stragglers — on the 
main sequence that are bluer than 
the turn-off. The origin of blue 
stragglers has puzzled astrono- 
mers ever since. They seem to be 
younger than other stars in the cluster because 
they are heavier, but have not yet evolved to 
red giants. They have been explained’ as close 
binary star systems. In sucha system, the heav- 
ier star evolves to a giant first. When the two 
stars are close enough, material at the surface 
of the giant becomes more attracted to, and 
flows on to, its companion. The donor loses its 
hydrogen-rich outer layers and collapses to a 
white dwarf star. The accretor grows to a mass 
that exceeds the main-sequence turn-off and 
becomes a blue straggler. However, its appar- 
ent youth is short-lived. Its increased mass 
means that it quickly exhausts its fuel, and it 
dies young for its original mass. 

One way to test this hypothesis would be 
to find a white dwarf companion to each blue 
straggler. However, white dwarfs are too faint 
to be seen directly. So Geller and Mathieu’ 
have looked for an indirect way to establish 
the mass of the companions. 

The strongest contending theory is that blue 
stragglers form when stars collide. Although 
clusters are not dense enough for many col- 
lisions to occur between single stars, binary 
stars in a cluster make larger targets and 
collisions with them often lead to the merg- 
ing of two stars*. Such a blue straggler may 
be ejected as a single star or left with a new 
companion. Because triple-star systems tend 
to be unstable and the lightest component is 
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Figure 1 | Blue stragglers in the star cluster NGC 188. Stars in NGC 188 
are plotted in a Hertzsprung—Russell diagram, in which the horizontal 

axis shows colour ranging from blue to red, and so temperature decreasing 
from left to right, and the vertical axis shows luminosity increasing from 
bottom to top. Most stars lie on the main-sequence region, but the older 
stars have evolved off this region to become red giants. More massive stars 
on the main sequence are bluer and a star at the bluest extent of the bulk 

of stars is said to be at the turn-off. Bluewards of the turn-off lie the blue 
straggler stars, which have not evolved to become red giants. Geller and 
Mathieu' report that many of these blue stragglers have a white dwarf 
companion. (Modified from ref. 8.) 


often ejected, these new companions tend 
to have masses larger than typical white 
dwarfs when the turn-off mass is above a solar 
mass Or so. 

In the densest clusters, most blue stragglers 
have been attributed to collisions between 
binary stars or binary and single stars. In 
sparser environments, known as open clus- 
ters, mass transfer in primordial binaries 
becomes more important. Today we can make 
detailed models of the dynamics of all the 
stars in an open cluster’ and include the evo- 
lution of individual and binary stars. We can 
predict the blue-straggler content and deduce 
that about half the blue stragglers in an open 
cluster derive from collisions and half from 
primordial binaries. 

The open cluster NGC 188 has 21 blue 
stragglers, 16 of which are known to be 
members of a binary system®. Their duplic- 
ity is confirmed by variations in the Doppler 
shift of the light from the blue straggler as it 
orbits its companion. The Doppler shift of an 
individual blue straggler is insufficient to 
calculate its companion’s mass because we 
do not know the orientation of the binary 
star’s orbit in space. However, if we make 
an assumption about the distribution of the 
masses of the companions, it is possible to 
predict the distribution of Doppler shifts, 
compare them with the observed Doppler 
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shifts and learn indirectly about 
the companions’ masses. 

Geller and Mathieu’ tested 
whether, in their theoretical model 
that best describes NGC 188, the 
distribution of the masses of com- 
panions to blue stragglers is consist- 
ent with the observed distribution 
of the blue stragglers’ Doppler 
shifts. It is not. They performed 
a statistical analysis known as a 
Kolmogorov-Smirnoyv test to com- 
pare their theoretical and observed 
Doppler shifts, and found that 
their modelled companion masses 
can be ruled out with more than 
99% confidence. However, the 
observed Doppler shifts are con- 
sistent with all of the companions 
being of about half a solar mass 
or just above. This mass is typical 
ofa white dwarf left at the end of the 
life of a star such as the Sun. Thus, 
in this cluster, a collisional origin 
for blue stragglers is much rarer 
than expected, and the authors’ 
study casts doubt on whether it 
occurs at all. 

So what is wrong with the theo- 
retical models? The modelling of 
the state of stars that are far from 
one another in a cluster is good 
because the dynamic interactions 
between them are precisely gov- 
erned by Newton's laws. However, 
for stars that are close to one another, the 
modelling is intricate, because processes of 
stellar interaction are complex and are yet to 
be fully unravelled. Sometimes we are sim- 
ply making an educated guess. In particular, 
theoretical understanding of what happens 
when stars collide and how they subsequently 
evolve is in its infancy’. Geller and Mathieu's 
finding is a solid indication that we must 
review our simple ideas of how stars merge 
when they collide. = 
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Gone with the wean 


Unlike in other mammals, neuron production in the subventricular region of the 
human brain becomes depleted in early infancy. This finding calls for a reassessment 
of the potential role of adult neurogenesis in health and disease. SEE LETTER P.382 


JON I. ARELLANO & PASKO RAKIC 


eurogenesis, the generation of neurons, 

is rare in the adult mammalian brain. 

Whereas other vertebrates, such as 
fish and amphibians, show extensive addi- 
tion of neurons and the capacity for neuronal 
regeneration in adulthood, mammals have 
mostly lost these abilities. In adult mammals, 
only two brain regions produce neurons: the 
dentate gyrus of the hippocampus and the sub- 
ventricular region of the lateral cerebral ven- 
tricle. Continuing neurogenesis is particularly 
robust in rodents’, and exists toa more modest 
extent in non-human primates’. It has there- 
fore been expected that similar neurogenic 
niches exist in the adult human brain, having 
possible roles in normal brain functioning — 
for instance, in learning and memory — as well 
as being a possible source of new neurons for 
neuronal replacement in brain injury and neu- 
rodegenerative disorders. On page 382 of this 
issue, Sanai et al.° provide evidence that such 
assumptions need to be reassessed. 

Of the two brain regions that retain the 
ability to generate neurons in adult mam- 
mals, the subventricular zone (SVZ), situated 
below the lateral cerebral ventricle, has been 
extensively characterized. Progenitors residing 
in this region divide to give rise to neuronal 
precursors that migrate through the rostral 
migratory stream (RMS) to the olfactory bulb’. 
There the cells differentiate into a type of neu- 
ron called a granule cell and become integrated 
into existing neural circuits (see Fig. 1a,b of 
the paper’). 

Sanai et al. show that the migratory cell 
streams departing from the SVZ are present in 
human newborns, but disappear by the eight- 
eenth month after birth. Previous reports’, 
including some from the same group’, had also 
indicated the absence of migratory streams 
leaving the SVZ in adult humans. A study® in 
2007, however, claimed to have detected an 
RMS in adult humans, with numerous cells 
migrating from the SVZ to the olfactory bulb. 
Although a subsequent paper by Sanai and col- 
leagues’ immediately exposed serious flaws 
in the methodology and conclusions of that 
study’, there has been considerable resistance 
in the field to accepting that the RMS is absent 
in adult humans. 

Sanai and co-workers’ most recent set of 
data’ closes the debate, providing strong evi- 
dence for the progressive depletion of neuronal 


production and migration during early infancy. 
These findings are particularly convincing 
because they come from the laboratory that 
discovered that neurons migrate as ‘chains’ of 
cells to the olfactory bulb and that has exten- 
sively characterized the RMS in adult mice’. 
For their analysis, the authors studied humans 
within the age range of 0-84 years. The brain 
specimens they collected came from brain 
sections that had been removed during surgery 
and from autopsied brains. They used extensive 
histological analysis and a battery of the most 
specific markers for detecting cell proliferation 
and for identifying neuronal and non-neuronal 
(for example, glial) cell lineages. 

The authors find that the newborn brain 
contains cells at the ventricular surface that 
have the morphology and molecular char- 
acteristics of neuronal progenitors, together 
with a significant number of cells expressing 
Ki67 — a marker of proliferating cells. More- 
over, they identify in the SVZ of newborns 
a large number of cells expressing DCX and 
PSA-NCAM, markers of immature migrating 
neurons, and some cells expressing EGFR, a 
marker of intermediate progenitors of neu- 
rons. Cells expressing DCX and PSA-NCAM 
were also present along the expected route of 
the RMS leading to the olfactory bulb. How- 
ever, Sanai and colleagues do not detect such 
features in infants older than 18 months, or in 
children and adults. It seems, therefore, that 
most, if not all, neurogenic activity in these 
areas of the human brain has disappeared by 
time infants are weaned. 

Unexpectedly, the authors also discovered 
that the RMS has a branch — they call it the 
medial migratory stream — that reaches the 
ventro-medial prefrontal cortex, a feature that 
has not been described in any non-human 
mammal after birth. This medial branch of 
the RMS begins to disappear earlier than 
the conventional RMS, by about 6 months of 
age, being no longer detectable by 8 months. 
Together, these data suggest that, in the human 
SVZ, the production of neurons that move via 
the RMS and the medial migratory stream isa 
developmental feature that only exists shortly 
after birth. This is similar to the more pro- 
longed postnatal neurogenesis and migration 
of granular cells in the cerebellum, which in 
humans lasts up to the age of two years’. 

Perhaps the most instructive lesson to be 
learnt from this study’ is that a feature or a 
cellular event found in some mammalian 
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50 Years Ago 


Despite some impressive 
demonstrations of root-pressure, 
‘pumping or positive pressure 

is generally thought to be of 

little importance for the ascent 
of sap in trees ... In ten species 
of palms at Calcutta I was able 

to detect and measure positive 
root-pressures throughout the 
year, sometimes enough to lift 
sap to heights exceeding that of 
the trees ... The positive pressure 
exceeds the negative pressure 
considerably. It often exceeds the 
plant’s needs ... The distal part 
of a cut root ofa fruited banana 
plant was seen to develop pressure. 
The plant was decapitated at 

2.25 m. above ground-level, and 
the cut root reconnected with a 
rubber tube. Other agencies such 
as transpiration are probably less 
important than root pressure. 
From Nature 21 October 1961 


100 Years Ago 


My attention has been directed 

to three elm trees at Ettington ... 
which it is said have been “killed 

by wasps.’ It appears that the wasps 
were attracted by the sweetness of 
the sap, and attacked the trees in 
such swarms, and so drained them 
of sap, that the death of the trees 
seems imminent, all the leaves 
having gone yellow long before the 
usual time. I should be glad to know 
if others have noticed attacks on 
elm trees, and whether the averred 
sweetness of the sap is due to some 
previous degenerative change in the 
tissues of the tree, or whether wasps 
would attack a normal tree if they 
could get access to the sap. The elms 
are all three comparatively young 
trees, and belong to the common 
variety. My informant tells me that 
he has previously noticed the same 
thing happen with an elm tree in 
one of his fields, which died the 
next winter. 

From Nature 19 October 1911 


species does not necessarily apply to all 
mammals, and especially not to humans. 
Despite great similarities in brain organiza- 
tion, there are also significant differences 
that could have profound functional conse- 
quences’”. Obviously, one such difference is a 
decrease in adult neurogenesis and neuronal 
turnover that seems to have accelerated with 
primate evolution and that may be beneficial 
for human mental capabilities’. Examination 
of apes such as chimpanzees should answer the 
question of whether reduced neurogenesis in 
the adult SVZ is a uniquely human trait or a 
more general trend in hominoids. The present 
work also underscores the importance of con- 
sidering species-specific differences in cellular 
mechanisms, and in their timing and function, 
in seemingly similar structures. 

The general decrease in adult neurogenesis 
with vertebrate evolution is associated with 
a diminished capacity for neuronal replace- 
ment and regeneration”. Reprogramming of 
neural progenitors in the SVZ and redirecting 
of neuronal migration to achieve regeneration 
in injured brain areas are already proving to 
be formidable challenges in rodents. The 


CHEMICAL BIOLOGY 


present results’, demonstrating the lack of 
significant neurogenesis and migration in the 
human SVZ beyond infancy, suggest that such 
strategies might be even more difficult to 
apply in our species. m 
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Many faces of a cancer- 
supporting protein 


The protein Hsp90 is a target of promising anticancer drugs. An analysis of the 
components of Hsp90 complexes in tumours reveals a path that may lead to 
predictive assays of drug sensitivity in cancer patients. 


JOHN F. DARBY & PAUL WORKMAN 


he protein Hsp90 is a molecular 
chaperone — it assists in the cor- 
rect folding of other cellular proteins. 
Many of these Hsp90 ‘client’ proteins are over- 
expressed and/or mutated in cancer and are 
involved in maintaining the cancerous state’. 
Hsp90 inhibition is therefore an attractive 
strategy for simultaneously blocking multi- 
ple abnormal pathways that are crucial for 
many tumour types’. Indeed, early clinical 
trials have confirmed the therapeutic poten- 
tial of this approach. But understanding the 
mechanism that ensures selective targeting of 
tumour cells by Hsp90 inhibitors, and finding 
biomarkers to predict which cancers will be 
most sensitive to such treatment, has proved 
challenging. Writing in Nature Chemical Biol- 
ogy, Moulick et al.* use chemical proteomics 
— acombination of chemical precipitation 
and multi-protein profiling — to shed light 
on these questions. 
There are some 20 Hsp90 inhibitors now 
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in clinical trials. Their effects have been most 
impressive in breast cancers that overexpress a 
highly sensitive Hsp90 client, the HER2 onco- 
protein, but that are resistant to the HER2- 
antibody drug trastuzumab*. They are also 
promising in non-small-cell lung tumours 
that express the mutated oncogenic pro- 
tein EML4-ALK, a similarly sensitive Hsp90 
client’. However, in other cancer types 
Hsp90 inhibition is less effective, despite the 
fact that the oncogenic constituents of such 
cancers are among Hsp90’s clientele. The best 
path to obtaining approval for widespread clin- 
ical use of these inhibitors may therefore be to 
apply them in particular tumour subtypes that 
are driven by highly Hsp90-dependent onco- 
proteins®. To predict an individual patient’s 
responsiveness, it will be crucial to define all 
of Hsp90’s oncoprotein clientele and to under- 
stand the make-up and function of chaperone- 
client complexes, together with the molecular 
networks in which they are involved. 
Large-scale genetic and messenger RNA/ 
protein-profiling efforts have revealed that 
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species does not necessarily apply to all 
mammals, and especially not to humans. 
Despite great similarities in brain organiza- 
tion, there are also significant differences 
that could have profound functional conse- 
quences’”. Obviously, one such difference is a 
decrease in adult neurogenesis and neuronal 
turnover that seems to have accelerated with 
primate evolution and that may be beneficial 
for human mental capabilities’. Examination 
of apes such as chimpanzees should answer the 
question of whether reduced neurogenesis in 
the adult SVZ is a uniquely human trait or a 
more general trend in hominoids. The present 
work also underscores the importance of con- 
sidering species-specific differences in cellular 
mechanisms, and in their timing and function, 
in seemingly similar structures. 

The general decrease in adult neurogenesis 
with vertebrate evolution is associated with 
a diminished capacity for neuronal replace- 
ment and regeneration”. Reprogramming of 
neural progenitors in the SVZ and redirecting 
of neuronal migration to achieve regeneration 
in injured brain areas are already proving to 
be formidable challenges in rodents. The 
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present results’, demonstrating the lack of 
significant neurogenesis and migration in the 
human SVZ beyond infancy, suggest that such 
strategies might be even more difficult to 
apply in our species. m 
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Many faces of a cancer- 
supporting protein 


The protein Hsp90 is a target of promising anticancer drugs. An analysis of the 
components of Hsp90 complexes in tumours reveals a path that may lead to 
predictive assays of drug sensitivity in cancer patients. 


JOHN F. DARBY & PAUL WORKMAN 


he protein Hsp90 is a molecular 
chaperone — it assists in the cor- 
rect folding of other cellular proteins. 
Many of these Hsp90 ‘client’ proteins are over- 
expressed and/or mutated in cancer and are 
involved in maintaining the cancerous state’. 
Hsp90 inhibition is therefore an attractive 
strategy for simultaneously blocking multi- 
ple abnormal pathways that are crucial for 
many tumour types’. Indeed, early clinical 
trials have confirmed the therapeutic poten- 
tial of this approach. But understanding the 
mechanism that ensures selective targeting of 
tumour cells by Hsp90 inhibitors, and finding 
biomarkers to predict which cancers will be 
most sensitive to such treatment, has proved 
challenging. Writing in Nature Chemical Biol- 
ogy, Moulick et al.* use chemical proteomics 
— acombination of chemical precipitation 
and multi-protein profiling — to shed light 
on these questions. 
There are some 20 Hsp90 inhibitors now 
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in clinical trials. Their effects have been most 
impressive in breast cancers that overexpress a 
highly sensitive Hsp90 client, the HER2 onco- 
protein, but that are resistant to the HER2- 
antibody drug trastuzumab*. They are also 
promising in non-small-cell lung tumours 
that express the mutated oncogenic pro- 
tein EML4-ALK, a similarly sensitive Hsp90 
client’. However, in other cancer types 
Hsp90 inhibition is less effective, despite the 
fact that the oncogenic constituents of such 
cancers are among Hsp90’s clientele. The best 
path to obtaining approval for widespread clin- 
ical use of these inhibitors may therefore be to 
apply them in particular tumour subtypes that 
are driven by highly Hsp90-dependent onco- 
proteins®. To predict an individual patient’s 
responsiveness, it will be crucial to define all 
of Hsp90’s oncoprotein clientele and to under- 
stand the make-up and function of chaperone- 
client complexes, together with the molecular 
networks in which they are involved. 
Large-scale genetic and messenger RNA/ 
protein-profiling efforts have revealed that 


many cellular processes require Hsp90 activity 
under both normal and stress conditions’. 
These screening studies also identified* many 
of the molecules that interact with Hsp90 
but have done little to explain the relevance 
of particular client proteins to traits seen in 
cancer or to help us understand drug sensitiv- 
ity. Moreover, although new Hsp90 clients have 
been found’, such approaches could not easily 
distinguish inhibitor-sensitive chaperone— 
client complexes from the ‘noisy’ background 
of other interactions involving Hsp90. 

A significant study'® published in 2003 
provided insight into why tumour cells are so 
much more sensitive to Hsp90 inhibition than 
healthy cells. It showed that, in cancer cells, 
Hsp90 exists in a highly active, multi-chap- 
erone complex that has much greater affinity 
for inhibitors than does Hsp90 in normal cells. 
However, the role of drug-sensitive Hsp90 
complexes and the identity of their constitu- 
ent clients and co-chaperones are still unclear 
some eight years later, highlighting the need 
for improved identification techniques. 

To discover specific, cancer-associated 
Hsp90 clients and pathways involved in chronic 
myeloid leukaemia (CML), Moulick et al.’ 
have profiled complexes that selectively bind 
to the Hsp90 inhibitor PU-H71, a drug that 
is undergoing clinical trial. Specifically, they 
used chemical precipitation, which selectively 
retrieves only those Hsp90 complexes that are 
‘available’ for inhibition. This is in contrast to 
immunoprecipitation of Hsp90, which recovers 
both drug-binding and non-binding complexes 
(Fig. 1). The authors find that PU-H71 binds to 
a limited fraction (10-20%) of cellular Hsp90, 
suggesting that two distinct Hsp90 subpopula- 
tions are present in CML, with only one being 
available for inhibition by the drug. Across the 
various leukaemic cell lines analysed, the abun- 
dance of PU-H71-bound Hsp90 complexes 
correlated with sensitivity to the inhibitor. 
Indeed, the results of another study" have also 
shown a correlation between inhibitor binding 
and drug sensitivity. 

Moulick and colleagues next performed 
protein profiling using mass spectrometry — 
a powerful proteomic screening approach — 
to identify components of the inhibitor-bound 
Hsp90 complexes. They find that PU-H71 
binds to Hsp90 that is in complex with well- 
known co-chaperones (Hsp40, Hsp70, HOP 
and HIP), but not to Hsp90 complexes lacking 
these co-chaperones. 

With regard to Hsp90 client proteins, 
Bcr-Abl — a product of gene translocation 
and an Hsp90 client that is the driver onco- 
protein in CML — is also preferentially pre- 
sent in PU-H71-bound Hsp90 complexes. By 
contrast, its non-oncogenic precursor, Abl, 
resides in complexes that did not bind this 
inhibitor. It seems, therefore, that Bcr-Abl is 
highly reliant on an active subpopulation of 
Hsp90 that is susceptible to inhibition in CML, 
consistent with the emerging evidence for the 
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Figure 1 | Analysis of Hsp90-containing 
complexes using chemical proteomics. Moulick 
et al.’ find that the Hsp90 inhibitor PU-H71 
selectively precipitates Hsp90 that is in complex 
with well-known co-chaperones and client proteins 
—a subpopulation that makes up roughly 10-20% 
of total cellular Hsp90. The abundance of this 
subpopulation correlates with inhibitor sensitivity. 
By contrast, immunoprecipitation using anti- 
Hsp90 antibodies recovers all Hsp90 complexes. 
Further proteomic and bioinformatic analyses 
reveal that the client proteins in these complexes 
include members of many signalling pathways 
that are deregulated in chronic myeloid leukaemia, 
as well as proteins, such as CARM1, that had not 
previously been associated with Hsp90. 


clinical effectiveness of Hsp90 inhibitors in 
this cancer. 

The authors’ freely available catalogue of 
the proteins found in PU-H71-bound Hsp90 
complexes (see Supplementary Information 
to the paper’) includes members of molecular 
signalling pathways that are associated with 
CML (for example, the PI3K-AKT-mTOR, 
NE-«B, RAF-MAPK, STAT and focal- 
adhesion signalling pathways). Focusing on 
a constituent of one of these pathways, the 
authors show that phosphorylation of the 
protein STATS, which is found in the phos- 
phorylated form in CML, requires Hsp90. 
Furthermore, proteins previously not known 
to be linked to Hsp90 were also found in 
PU-H71-bound Hsp90 complexes. For 
instance, CARM1 (also called PRMT4) — 
an enzyme that regulates gene transcription 
and the levels of which are increased in some 
cancers — is entirely sequestered in these 
Hsp90 complexes, and, as Moulick et al. show, 
is required for the viability of CML cells. This 
finding complements previous proteomic 
studies’ that identified the CARM1-related 
enzyme PRMT5as a novel Hsp90 client. 


NEWS & VIEWS | RESEARCH | 


Chemical proteomics is a powerful way of 
gaining a functional picture of the bewilder- 
ingly complex interactions in which Hsp90 
is involved. This approach could also predict 
which Hsp90 clients are likely to be drug- 
sensitive. Moreover, expansion of the pro- 
teomic analysis to compare a wider range of 
tumour and normal cell types may reveal why 
some malignant cells are more sensitive to 
drug inhibition than others. 

But perhaps the most exciting potential 
use of this new approach is in personalized 
medicine, to predict whether a patient would 
respond to treatment with Hsp90 inhibi- 
tors. Thus, it is conceivable that determining 
the proportion of the Hsp90 population in a 
patient’s cancer cells that is available for bind- 
ing to a tagged inhibitor could predict drug 
sensitivity in the clinic. 

But this is not the end of the story, and, as 
always, questions remain. For example, what 
is the composition and function of the Hsp90 
subpopulation in cancer cells that does not 
bind to PU-H71? What prevents the inhibitor 
from binding to that subpopulation? How do 
Hsp90-regulating co-chaperones and post- 
translational modifications of Hsp90 control 
drug binding to the multi-chaperone complex? 
A puzzling observation made by Moulick et al. 
is that some potent, but chemically distinct, 
Hsp90 inhibitors that target the same site on 
this chaperone have different preferences for 
Hsp90 subpopulations from PU-H71. How 
does this relate to the cellular effects and thera- 
peutic efficacy of the various inhibitors? 

That Hsp90 has a broad range of clients is a 
potential advantage for therapy, but a challenge 
for predicting an individual patient’s response. 
Increased understanding of Hsp90-client 
complexes in various tumour and healthy cells 
should help to pave the way for personalized 
clinical application of Hsp90 inhibitors, and 
could address a fundamental question — what 
makes a protein an Hsp90 client? m 
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Solutions for a cultivated planet 


Jonathan A. Foley', Navin Ramankutty”, Kate A. Brauman', Emily S. Cassidy', James S. Gerber!, Matt Johnston!, 
Nathaniel D. Mueller', Christine O’Connell', Deepak K. Ray’, Paul C. West!, Christian Balzer®, Elena M. Bennett’, 
Stephen R. Carpenter”, Jason Hill’®, Chad Monfreda’, Stephen Polasky"’, Johan Rockstrém’, John Sheehan!, Stefan Siebert'®, 


David Tilman"! & David P. M. Zaks!” 


Increasing population and consumption are placing unprecedented demands on agriculture and natural resources. 
Today, approximately a billion people are chronically malnourished while our agricultural systems are concurrently 
degrading land, water, biodiversity and climate on a global scale. To meet the world’s future food security and 
sustainability needs, food production must grow substantially while, at the same time, agriculture’s environmental 
footprint must shrink dramatically. Here we analyse solutions to this dilemma, showing that tremendous progress 
could be made by halting agricultural expansion, closing ‘yield gaps’ on underperforming lands, increasing cropping 
efficiency, shifting diets and reducing waste. Together, these strategies could double food production while greatly 


reducing the environmental impacts of agriculture. 


ontemporary agriculture faces enormous challenges’ *. Even 

with recent productivity gains, roughly one in seven people lack 

access to food or are chronically malnourished, stemming from 
continued poverty and mounting food prices**. Unfortunately, the situ- 
ation may worsen as food prices experience shocks from market specu- 
lation, bioenergy crop expansion and climatic disturbances’. Even if we 
solve these food access challenges, much more crop production will 
probably be needed to guarantee future food security. Recent studies 
suggest that production would need to roughly double to keep pace with 
projected demands from population growth, dietary changes (especially 
meat consumption), and increasing bioenergy use’ **”, unless there are 
dramatic changes in agricultural consumption patterns. 

Compounding this challenge, agriculture must also address tremend- 
ous environmental concerns. Agriculture is now a dominant force 
behind many environmental threats, including climate change, biodi- 
versity loss and degradation of land and freshwater'®. In fact, agricul- 
ture is a major force driving the environment beyond the “planetary 
boundaries” of ref. 13. 

Looking forward, we face one of the greatest challenges of the twenty- 
first century: meeting society’s growing food needs while simultaneously 
reducing agriculture’s environmental harm. Here we consider several 
promising solutions to this grand challenge. Using new geospatial data 
and models, we evaluate how new approaches to agriculture could bene- 
fit both food production and environmental sustainability. Our analysis 
focuses on the agronomic and environmental aspects of these chal- 
lenges, and leaves a richer discussion of associated social, economic 
and cultural issues to future work. 


The state of global agriculture 

Until recently, the scientific community could not measure, monitor and 
analyse the agriculture-food—environment system’s complex linkages at 
the global scale. Today, however, we have new data that characterize 
worldwide patterns and trends in agriculture and the environment'*"”. 


Agricultural extent 
According to the Food and Agriculture Organization (FAO) of the 
United Nations, croplands cover 1.53 billion hectares (about 12% of 
Earth’s ice-free land), while pastures cover another 3.38 billion hectares 
(about 26% of Earth’s ice-free land) (Supplementary Fig. 1). Altogether, 
agriculture occupies about 38% of Earth’s terrestrial surface—the largest 
use of land on the planet’*'*. These areas comprise the land best suited 
for farming’’: much of the rest is covered by deserts, mountains, tundra, 
cities, ecological reserves and other lands unsuitable for agriculture”’. 
Between 1985 and 2005 the world’s croplands and pastures expanded 
by 154 million hectares (about 3%). But this slow net increase includes 
significant expansion in some areas (the tropics), as well as little change or 
a decrease in others (the temperate zone'*; Supplementary Table 1). The 
result is a net redistribution of agricultural land towards the tropics, with 
implications for food production, food security and the environment. 


Crop yields 

Global crop production has increased substantially in recent decades. 
Studies of common crop groups (including cereals, oilseeds, fruits and 
vegetables) suggest that crop production increased by 47% between 1985 
and 2005 (ref. 18). However, considering all 174 crops tracked by the UN 
FAO and ref. 15, we find global crop production increased by only 28% 
during that time”. 

This 28% gain in production occurred as cropland area increased by 
only 2.4%, suggesting a 25% increase in yield. However, cropland area that 
was harvested increased by about 7% between 1985 and 2005—nearly 
three times the change in cropland area, owing to increased multiple 
cropping, fewer crop failures, and less land left fallow. Accounting for 
the increase in harvested land, average global crop yields increased by only 
20% between 1985 and 2005, substantially less than the often-cited 47% 
production increase for selected crop groups. (Using the same methods as 
for the 20% result, we note that yields increased by 56% between 1965 and 
1985, indicating that yields are now rising less quickly than before.) 
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Aggregate measures of production can mask trends in individual 
crops or crop groups (Supplementary Fig. 2a). For example, cereal crops 
decreased in harvested area by 3.6% between 1985 and 2005, yet their 
total production increased by 29%, reflecting a 34% increase in yields per 
hectare. Oil crops, on the other hand, showed large increases in both 
harvested area (43%) and yield (57%), resulting in a 125% increase in 
total production’’. While most crops increased production between 
1985 and 2005, fodder crops did not: on average, they saw an 18% 
production drop as a 26% loss in harvested area overrode an 11% 
increase in yields. 

Using geospatial data’*, we can examine how yield patterns have 
changed for key commodities (for example, maize in Supplementary 
Fig. 2b). These geographic patterns show us where productivity gains 
have been successful, where they have not, and where further oppor- 
tunities for improvement lie. 


Crop use and allocation 
The allocation of crops to nonfood uses, including animal feed, seed, 
bioenergy and other industrial products, affects the amount of food 
available to the world. Globally, only 62% of crop production (on a mass 
basis) is allocated to human food, versus 35% to animal feed (which 
produces human food indirectly, and much less efficiently, as meat and 
dairy products) and 3% for bioenergy, seed and other industrial products. 
A striking disparity exists between regions that primarily grow crops 
for direct human consumption and those that produce crops for other 
uses (Fig. 1). North America and Europe devote only about 40% of their 
croplands to direct food production, whereas Africa and Asia allocate 
typically over 80% of their cropland to food crops. Extremes range from 
the Upper Midwestern USA (less than 25%) to South Asia (over 90%). 
As we face the twin challenges of feeding a growing world while 
charting a more environmentally sustainable path, the amount of land 
(and other resources) devoted to animal-based agriculture merits critical 
evaluation. For example, adding croplands devoted to animal feed 
(about 350 million hectares) to pasture and grazing lands (3.38 billion 
hectares), we find the land devoted to raising animals totals 3.73 billion 
hectares—an astonishing ~75% of the world’s agricultural land. We 
further note that meat and dairy production can either add to or subtract 
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Figure 1 | Allocation of cropland area to different uses in 2000. Here we 
show the fraction of the world’s total cropland that is dedicated to growing food 
crops (crops that are directly consumed by people) versus all other crop uses, 
including animal feed, fibre, bioenergy crops and other products. Averaged 
across the globe, 62% of total crop production (on a mass basis) is allocated to 
human food, 35% for animal feed (which produces human food indirectly, and 
less efficiently, as meat and dairy products) and 3% for bioenergy crops, seed, 
and other industrial products. There are striking disparities between regions 
that primarily grow crops for human consumption (such as Africa, South Asia, 
East Asia), and those that mainly produce crops for other uses (such as North 
America, Europe, Australia). Food production and allocation data were 
obtained from FAOSTAT“, and were then applied to the spatial cropland maps 
of refs 14 and 15. All data are for a seven-year period centred on 2000. 
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from the world’s food supply. Grazing systems, especially on pastures 
unsuitable for other food production, and mixed crop-livestock systems 
can add calories and protein to the world and improve economic con- 
ditions and food security in many regions. However, using highly pro- 
ductive croplands to produce animal feed, no matter how efficiently, 
represents a net drain on the world’s potential food supply. 


Global environmental impacts of agriculture 


The environmental impacts of agriculture include those caused by 
expansion (when croplands and pastures extend into new areas, repla- 
cing natural ecosystems) and those caused by intensification (when 
existing lands are managed to be more productive, often through the 
use of irrigation, fertilizers, biocides and mechanization). Below, we use 
new data and models'”*'” to examine both. 

Agricultural expansion has had tremendous impacts on habitats, bio- 
diversity, carbon storage and soil conditions'®''’***. In fact, worldwide 
agriculture has already cleared or converted 70% of the grassland, 50% of 
the savanna, 45% of the temperate deciduous forest, and 27% of the 
tropical forest biome!”». 

Today, agriculture is mainly expanding in the tropics, where it is 
estimated that about 80% of new croplands are replacing forests”*. This 
expansion is worrisome, given that tropical forests are rich reservoirs of 
biodiversity and key ecosystem services”’. Clearing tropical forests is also 
a major source of greenhouse gas emissions and is estimated to release 
around 1.1 X 10’° grams of carbon per year, or about 12% of total anthro- 
pogenic CO) emissions’*. Slowing or halting expansion of agriculture in 
the tropics—which accounts for 98% of total CO emissions from land 
clearing’’—will reduce carbon emissions as well as losses of biodiversity 
and ecosystem services”. 

Agricultural intensification has dramatically increased in recent decades, 
outstripping rates of agricultural expansion, and has been responsible for 
most of the yield increases of the past few decades. In the past 50 years, the 
world’s irrigated cropland area roughly doubled’*”**', while global fertilizer 
use increased by 500% (over 800% for nitrogen alone)'****. Intensification 
has also caused water degradation, increased energy use, and widespread 
pollution®?***°, 

Of particular concern is that some 70% of global freshwater with- 
drawals (80-90% of consumptive uses) are devoted to irrigation**”’. 
Furthermore, rain-fed agriculture is the world’s largest user of water 
In addition, fertilizer use, manure application, and leguminous crops 
(which fix nitrogen in the soil) have dramatically disrupted global nitro- 
gen and phosphorus cycles®’, with associated impacts on water quality, 
aquatic ecosystems and marine fisheries***’. 

Both agricultural expansion and intensification are also major con- 
tributors to climate change. Agriculture is responsible for 30-35% of 
global greenhouse gas emissions, largely from tropical deforestation, 
methane emissions from livestock and rice cultivation, and nitrous 
oxide emissions from fertilized soils”**. 

We can draw important conclusions from these trends. First, the 
expansion of agriculture in the tropics is reducing biodiversity, increas- 
ing greenhouse gas emissions, and depleting critical ecosystem services. 
Yet this expansion has done relatively little to add to global food sup- 
plies; most production gains have been achieved through intensification. 
Second, the costs and benefits of agricultural intensification vary greatly, 
often depending on geographic conditions and agronomic practices. 
This suggests that some forms (and locations) of intensification are 
better than others at balancing food production and environmental 
protection'!*”, 
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Enhancing food production and sustainability 


Until recently, most agricultural paradigms have focused on improving 
production, often to the detriment of the environment'®”’*’. Likewise, 
many environmental conservation strategies have not sought to 
improve food production. However, to achieve global food security 
and environmental sustainability, agricultural systems must be trans- 
formed to address both challenges (Fig. 2). 
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Figure 2 | Meeting goals for food security and environmental sustainability 
by 2050. Here we qualitatively illustrate a subset of the goals agriculture must 
meet in the coming decades. At the top, we outline four key food security goals: 
increasing total agricultural production, increasing the supply of food 
(recognizing that agricultural yields are not always equivalent to food), 
improving the distribution of and access to food, and increasing the resilience of 
the whole food system. At the bottom, we illustrate four key environmental goals 
agriculture must also meet: reducing greenhouse gas emissions from agriculture 
and land use, reducing biodiversity loss, phasing out unsustainable water 
withdrawals, and curtailing air and water pollution from agriculture. Panel 

a sketches out a qualitative assessment of how current agricultural systems may 
be measured against these criteria compared to goals set for 2050. Panel 

b illustrates a hypothetical situation in which we meet all of these goals by 2050. 


First, the transformation of agriculture must deliver sufficient food 
and nutrition to the world. To meet the projected demands of popu- 
lation growth and increasing consumption, we must roughly double 
food supplies in the next few decades'*. We must also improve distri- 
bution and access, which will require further changes in the food system. 

The transformation of agriculture should also (1) cut greenhouse gas 
emissions from land use and farming by at least 80% (ref. 48); (2) reduce 
biodiversity and habitat losses; (3) reduce unsustainable water withdra- 
wals, especially where water has competing demands; and (4) phase out 
water pollution from agricultural chemicals. Other environmental issues 
must also be addressed, but these four undergird the relationship 
between agriculture and the environment and should be addressed as 
necessary first steps. 

An influential series of recent reports has suggested possible solutions 
to our interwoven food security and environmental challenges'”®. 
Below, we consider the potential strengths and weaknesses of four pro- 
posed strategies. 


Stop expanding agriculture 
The expansion of agriculture into sensitive ecosystems has far-reaching 
effects on biodiversity, carbon storage and important environmental 
services'®''*°, This is particularly true when tropical forests are cleared 
for agriculture’””*’°, estimated to cause 5-10 million hectares of forest 
loss annually'**'. Slowing (and, ultimately, ceasing) the expansion of 
agriculture, particularly into tropical forests, will be an important first 
step in shifting agriculture onto a more sustainable path. 

But will ending the expansion of agriculture negatively affect food 
supplies? Our analysis suggests that the food production benefits of 
tropical deforestation are often limited, especially compared to the 
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environmental damages accrued. First of all, many regions cleared for 
agriculture in the tropics have low yields compared with their temperate 
counterparts. The authors of ref. 21 considered crop production and 
carbon emissions resulting from deforestation and demonstrated that 
the balance of production gains to carbon losses was often poor in 
tropical landscapes (Supplementary Fig. 3). Regions of tropical agricul- 
ture that do have high yields—particularly areas of sugarcane, oil palm 
and soybeans—typically do not contribute much to the world’s total 
calorie or protein supplies, especially when crops are used for feed or 
biofuels. Nevertheless, such crops do provide income, and thereby con- 
tribute to poverty alleviation and food security to some sectors of the 
population. 

Although ceasing the expansion of agriculture into tropical forests 
might have a negative—but probably small—impact on global crop pro- 
duction, losses can be offset elsewhere in the food system. Agricultural 
production potential that is ‘lost’ by halting deforestation could be offset 
by reducing losses of productive farmland and improving yields on exist- 
ing croplands. Though the ‘indirect land use’ effects of biofuel production 
are thought to increase pressure on tropical forests”, it may also be true 
that increasing food production in non-tropical zones might reduce 
pressures on tropical forests. 

Economic drivers hold great sway over deforestation’>*°. Ecologically 
friendly economic incentives could play an important part in slowing 
forest loss: the proposed Reducing Emissions from Deforestation and 
Degradation (REDD) programme, market certification, and ecotourism 
all provide opportunities to benefit economically from forest protection”. 


Close yield gaps 

Increasing food production without agricultural expansion implies that 
we must increase production on our existing agricultural lands. The best 
places to improve crop yields may be on underperforming landscapes, 
where yields are currently below average. 

Recent analyses*”** have found large yield variations across the world, 
even among regions with similar growing conditions, suggesting the 
existence of ‘yield gaps’ (Supplementary Fig. 4a). Here we define a yield 
gap as the difference between crop yields observed at any given location 
and the crop’s potential yield at the same location given current agri- 
cultural practices and technologies. 

Much of the world experiences yield gaps (Supplementary Fig. 4a) 
where productivity may be limited by management. There are significant 
opportunities to increase yields across many parts of Africa, Latin 
America and Eastern Europe, where nutrient and water limitations seem 
to be strongest (Supplementary Fig. 4b). Better deployment of existing 
crop varieties with improved management should be able to close many 
yield gaps”, while continued improvements in crop genetics will probably 
increase potential yields into the future. 

Closing yield gaps could substantially increase global food supplies. 
Our analysis shows that bringing yields to within 95% of their potential 
for 16 important food and feed crops could add 2.3 billion tonnes 
(5 X 10'° kilocalories) of new production, a 58% increase (Fig. 3). 
Even if yields for these 16 crops were brought up to only 75% of their 
potential, global production would increase by 1.1 billion tonnes 
(2.8 X 10° kilocalories), a 28% increase. Additional gains in productivity, 
focused on increasing the maximum yield of key crops, are likely to be 
driven by genetic improvements™*'. Significant opportunities may also 
exist to improve yield and the resilience of cropping systems by improving 
‘orphan crops’ (such crops have not been genetically improved or had 
much investment) and preserving crop diversity, which have received 
relatively little investment to date. 

To close global yield gaps, the interwoven challenges of production and 
environment must again be addressed: conventional approaches to inten- 
sive agriculture, especially the unbridled use of irrigation and fertilizers, 
have been major causes of environmental degradation. Closing yield gaps 
without environmental degradation will require new approaches, includ- 
ing reforming conventional agriculture and adopting lessons from organic 
systems and precision agriculture. In addition, closing yield gaps will 
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Figure 3 | Closing global yield gaps. Many agricultural lands do not attain 
their full yield potential. The figure shows the new calories that would be made 
available to the world from closing the yield gaps for 16 major crops: barley, 
cassava, groundnut, maize, millet, potato, oil palm, rapeseed, rice, rye, sorghum, 
soybean, sugarbeet, sugarcane, sunflower and wheat. This analysis shows that 
bringing the world’s yields to within 95% of their potential for these 16 
important food and feed crops could add 2.3 billion tonnes 

(5 X 10’ kilocalories) of new crop production, representing a 58% increase. 
These improvements in yield can be largely accomplished by improving the 
nutrient and water supplies to crops in low-yielding regions; further 
enhancement of global food production could be achieved through improved 
crop genetics. The methods used to calculate yield gaps and limiting factors are 
described in the Supplementary Information. 


require overcoming considerable economic and social challenges, includ- 
ing the distribution of agricultural inputs and seed varieties and improving 
market infrastructure. 


Increase agricultural resource efficiency 

Moving forward, we must find more sustainable pathways for intensi- 
fication that increase crop production while greatly reducing unsustain- 
able uses of water, nutrients and agricultural chemicals. 

Irrigation is currently responsible for water withdrawals of about 
2,800 km’ per year from groundwater, lakes and rivers. Irrigation is used 
on about 24% of croplands and is responsible for delivering 34% of 
agricultural production”. In fact, without irrigation, global cereal pro- 
duction would decrease by an estimated 20% (ref. 17), so more land 
would be required to produce the same amount of food. 

However, the benefits and impacts of irrigation are not evenly dis- 
tributed. Water needed for crop production varies greatly across the 
world (Supplementary Fig. 5). We find that, when irrigated, 16 staple 
crops use an average of 0.3 litres per kilocalorie (not including water 
losses). However, these water requirements are skewed: 80% of irrigated 
crops require less than 0.4 litres per kilocalorie, while the remaining 20% 
require 0.7 litres per kilocalorie or more. 

Where water is scarce, good water and land management practices 
can increase irrigation efficiency. For example, curtailing off-field evap- 
orative losses from water storage and transport and reducing on-field 
losses through mulching and reduced tillage will increase the value of 
irrigation water. 

Chemical fertilizers, manure and leguminous crops have also been key 
to agricultural intensification. However, they have also led to widespread 
nutrient pollution and the degradation of lakes, rivers and coastal oceans. 
In addition, the release of nitrous oxide from fertilized fields contributes 
to climate change. Excess nutrients also incur energy costs associated 
with converting atmospheric nitrogen and mining phosphorus”. 

Even though excess nutrients cause environmental problems in some 
parts of the world, insufficient nutrients are a major agronomic problem 
in others. Many yield gaps are mainly due to insufficient nutrient avail- 
ability (Supplementary Fig. 4b). This ‘Goldilocks’ problem of nutrients 
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(that is, there are many regions with too much or too little fertilizer but 
few that are ‘just right’) is one of the key issues facing agriculture today. 

Building on recent analyses of crop production, fertilizer use and 
nutrient cycling’*****°, we examine patterns of agricultural nitrogen 
and phosphorus balance across the world. Specifically, we show areas 
of excess nutrients resulting from imbalances between nutrient inputs 
(fertilizers, legumes and atmospheric deposition), harvest removal and 
environmental losses (Supplementary Fig. 6). We further analyse the 
efficiency of nutrient use by comparing applied nutrients to yield for 16 
major crops (Supplementary Fig. 6c, d). 

Our analysis reveals ‘hotspots’ of low nutrient use efficiency (Sup- 
plementary Fig. 6c, d) and large volumes of excess nutrients (Sup- 
plementary Fig. 6e, f). Nutrient excesses are especially large in 
China®’, Northern India, the USA and Western Europe. We also find 
that only 10% of the world’s croplands account for 32% of the global 
nitrogen surplus and 40% of the phosphorus surplus. Targeted policy 
and management in these regions could improve the balance between 
yields and the environment. Such actions include reducing excessive 
fertilizer use, improving manure management, and capturing excess 
nutrients through recycling, wetland restoration and other practices. 

Taken together, these results illustrate many opportunities to improve 
the water and nutrient efficiency of agriculture without reducing food 
production. Targeting particular ‘hotspots’ of low efficiency, measured 
as the disproportionate use of water and nutrient inputs relative to 
production, could significantly reduce the environmental problems of 
intensive agriculture. Furthermore, agroecological innovations in crop 
and soil management" show great promise for improving the resource 
efficiency of agriculture, maintaining the benefits of intensive agricul- 
ture while greatly reducing harm to the environment. 


Increase food delivery by shifting diets and reducing waste 
While improving crop yields and reducing agriculture’s environmental 
impacts will be instrumental in meeting future needs, it is also important 
to remember that more food can be delivered by changing our agricul- 
tural and dietary preferences. Simply put, we can increase food avail- 
ability (in terms of calories, protein and critical nutrients) by shifting 
crop production away from livestock feed, bioenergy crops and other 
non-food applications. 

In Supplementary Fig. 7, we compare intrinsic food production (calories 
available if all crops were consumed by humans) and delivered food pro- 
duction (calories available based on today’s allocation of crops to food, 
animal feed, and other products, assuming standard conversion factors) for 
16 staple crops. By subtracting these two figures, we estimate the potential 
to increase food supplies by closing the “diet gap’: shifting 16 major crops to 
100% human food could add over a billion tonnes to global food produc- 
tion (a 28% increase), or the equivalent of 3 X 10’° food kilocalories (a 49% 
increase) (Fig. 4). 

Of course, the current allocation of crops has many economic and 
social benefits, and this mixed use is not likely to change completely. But 
even small changes in diet (for example, shifting grain-fed beef con- 
sumption to poultry, pork or pasture-fed beef) and bioenergy policy (for 
example, not using food crops as biofuel feedstocks) could enhance food 
availability and reduce the environmental impacts of agriculture. 

A large volume of food is never consumed but is instead discarded, 
degraded or consumed by pests along the supply chain. A recent FAO 
study® suggests that about one-third of food is never consumed; others” 
have suggested that as much as half of all food grown is lost; and some 
perishable commodities have post-harvest losses of up to 100% (ref. 70). 
Developing countries lose more than 40% of food post-harvest or during 
processing because of storage and transport conditions. Industrialized 
countries have lower producer losses, but at the retail or consumer level 
more than 40% of food may be wasted®. 

In short, reducing food waste and rethinking dietary, bioenergy and 
other agricultural choices could substantially improve the delivery of 
calories and nutrition with no accompanying environmental harm. 
While wholesale conversions of the human diet and the elimination of 
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Figure 4 | Closing the diet gap. We estimate the potential to increase food 
supplies by closing the ‘diet gap’: shifting 16 major crops to 100% human food 
and away from the current mix of uses (see Fig. 1) could add over a billion 
tonnes to global food production (a 28% increase for those 16 crops), the 
equivalent of ~3 X 10’ kilocalories more food to the global diet (a 49% 
increase in food calories delivered). 


food waste are not realistic goals, even incremental steps could be extre- 
mely beneficial. Furthermore, targeted efforts—such as reducing waste 
in our most resource-intensive foods, especially meat and dairy—could 
be designed for optimal impact. 


Searching for practical solutions 

Today, humans are farming more of the planet than ever, with higher 
resource intensity and staggering environmental impacts, while divert- 
ing an increasing fraction of crops to animals, biofuels and other non- 
food uses. Meanwhile, almost a billion people are chronically hungry. 
This must not continue: the requirements of current and future genera- 
tions demand that we transform agriculture to meet the twin challenges 
of food security and environmental sustainability. 

Our analysis demonstrates that four core strategies can—in principle— 
meet future food production needs and environmental challenges if 
deployed simultaneously. Adding them together, they increase global food 
availability by 100-180%, meeting projected demands while lowering 
greenhouse gas emissions, biodiversity losses, water use and water pol- 
lution. However, all four strategies are needed to meet our global food 
production and environmental goals; no single strategy is sufficient. 

We have described general approaches to solving global agricultural 
challenges, but much work remains to translate them into action. 
Specific land use, agricultural and food system tactics must be developed 
and deployed. Fortunately, many such tactics already exist, including 
precision agriculture, drip irrigation, organic soil remedies, buffer strips 
and wetland restoration, new crop varieties that reduce needs for water 
and fertilizer, perennial grains and tree-cropping systems, and paying 
farmers for environmental services. However, deploying these tactics 
effectively around the world requires numerous economic and govern- 
ance challenges to be overcome. For example, reforming global trade 
policies, including eliminating price-distorting subsidies and tariffs, will 
be vital to achieving our strategies. 

In developing improved land use and agricultural practices, we 
recommend following these guidelines: 

(1) Solutions should focus on critical biophysical and economic ‘lever- 
age points’ in agricultural systems, where major improvements in food 
production or environmental performance may be achieved with the 
least effort and cost. 

(2) New practices must also increase the resilience of the food system. 
High-efficiency, industrialized agriculture has many benefits, but it is 
vulnerable to disasters”', including climatic disturbances, new diseases 
and economic calamities. 
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(3) Agricultural activities have many costs and benefits, but methods of 
evaluating the trade-offs are still poorly developed”’. We need better 
data and decision support tools to improve management decisions”’, 
productivity and environmental stewardship. 

(4) The search for agricultural solutions should remain technology- 
neutral. There are multiple paths to improving the production, food 
security and environmental performance of agriculture, and we should 
not be locked into a single approach a priori, whether it be conventional 
agriculture, genetic modification or organic farming. 

The challenges facing agriculture today are unlike anything we have 
experienced before, and they require revolutionary approaches to solv- 
ing food production and sustainability problems. In short, new agricul- 
tural systems must deliver more human value, to those who need it most, 
with the least environmental harm. 
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Changes in gene expression are thought to underlie many of the phenotypic differences between species. However, 
large-scale analyses of gene expression evolution were until recently prevented by technological limitations. Here we 
report the sequencing of polyadenylated RNA from six organs across ten species that represent all major mammalian 
lineages (placentals, marsupials and monotremes) and birds (the evolutionary outgroup), with the goal of understanding 
the dynamics of mammalian transcriptome evolution. We show that the rate of gene expression evolution varies among 
organs, lineages and chromosomes, owing to differences in selective pressures: transcriptome change was slow in 
nervous tissues and rapid in testes, slower in rodents than in apes and monotremes, and rapid for the X chromosome 
right after its formation. Although gene expression evolution in mammals was strongly shaped by purifying selection, we 
identify numerous potentially selectively driven expression switches, which occurred at different rates across lineages 
and tissues and which probably contributed to the specific organ biology of various mammals. 


Shared mammalian traits include lactation, hair and relatively large 
brains with unique structures’. In addition to these traits, individual 
lineages have evolved distinct anatomical, physiological and beha- 
vioural characteristics relating to differences in reproduction, life span, 
cognitive abilities and disease susceptibility. The molecular changes 
underlying these phenotypic shifts and the associated selective pres- 
sures have begun to be investigated using available mammalian 
genomes’, the number of which is rapidly increasing. However, 
although genome analyses may uncover protein-coding changes that 
potentially underlie phenotypic alterations, regulatory mutations 
affecting gene expression probably explain many or even most pheno- 
typic differences between species’. 

Until recently, comparisons of mammalian transcriptomes were 
essentially restricted to closely related primates** or mice’, although 
human-mouse comparisons using microarrays were also attempted”. 
Nevertheless, microarrays require hybridization to species-specific 
probes, making between-species comparisons of transcript abund- 
ance difficult®. The development of RNA sequencing (RNA-seq) pro- 
tocols now allows for accurate and sensitive assessments of expression 
levels'®. The power of RNA-seq for transcriptome assessment was 
recently demonstrated for human individuals''’” and closely related 
primates’*"*. 


RNA-seq and genome reannotation 


To study mammalian transcriptome evolution at high resolution, we 
generated RNA-seq data (~3.2 billion lumina Genome Analyser IIx 
reads of 76 base pairs) for the polyadenylated RNA fraction of brain 
(cerebral cortex or whole brain without cerebellum), cerebellum, 
heart, kidney, liver and testis (usually from one male and one female 
per somatic tissue, and two males for testis) from nine mammalian 
species (Supplementary Tables 1 and 2, Methods and Supplementary 


Note): placental mammals (great apes, including humans; rhesus 
macaque; and mouse), marsupials (grey short-tailed opossum) and 
monotremes (platypus). Corresponding data (~0.3 billion reads) 
were generated for a bird (red jungle fowl, a non-domesticated 
chicken) and used as an evolutionary outgroup. 

We refined existing Ensembl’* genome annotations by performing 
an initial read mapping to detect transcribed regions and splice junc- 
tions (Methods and Supplementary Note), which resulted in modified 
boundaries for ~31,000-44,500 exons and the addition of 20,000- 
34,500 new exons and 66,000-125,000 new splice junctions to known 
protein-coding genes (Supplementary Note Tables 4 and 5). We also 
searched de novo for multi-exonic transcribed loci; our results vali- 
dated most Ensembl-annotated protein-coding genes, pseudogenes 
and long non-coding RNA genes (Supplementary Note Table 11), 
but we also detected thousands of multi-exonic transcribed loci 
(possibly representing protein-coding or non-coding RNA genes) in 
previously unannotated regions (Supplementary Note Table 10). 

Newly detected exons are transcribed at lower levels and are signifi- 
cantly less conserved, at the sequence level, than Ensembl-annotated 
exons (two-tailed P< 10 8, Mann-Whitney U-test; Supplementary 
Fig. 1). However, the sequence conservation level is higher for new 
exons than for flanking introns, with visible peaks around splice sites, 
indicating that many of these exon sequences are preserved by puri- 
fying selection’®. 

Depending on the species, 11-30% of the total genomic length is 
covered by unambiguously mapped RNA-seq reads (Table 1). Much 
of the covered length is explained by retained introns, but substantial 
coverage is also found outside annotated regions (Table 1). Our data 
suggest that large proportions (>34-61%) of amniote (that is, 
mammal and bird) genomes are transcribed, consistent with previous 
work”, 
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Table 1 | Assessment of the transcribed fraction of amniote genomes based on RNA-seq 


Species 


Ensembl genes (Mb) 


Intergenic multi-exonic transcribed loci (Mb) 


Other int 


ergenic Total covered 


Total transcribed& No. reads]|| 


islands (Mb) (Mb) (Mb) (10°) 
Exons* Introns+ Exons* Introns+ 

Human 80 (84%) 410 (29%) 3.5 (100%) 12 (15%) 121 627 (20%) 1,685 (54%) 255 
Chimp 58 (93%) 404 (36%) 8.4 (100%) 30 (20%) 185 685 (19%) 1,508 (43%) 199 
Gorilla 53 (90%) 297 (32%) 7.6 (100%) 30 (19%) 179 567 (19%) 1,314 (43%) 163 
Orangutan 43 (89%) 260 (27%) 6.2 (100%) 22 (20%) 144 475 (14%) 1,259 (37%) 131 
Macaque 52 (92%) 365 (35%) 9.0 (100%) 35 (22%) 202 663 (21%) 1,455 (47%) 156 
Mouse 71 (88%) 433 (42%) 5.1 (100%) 8 (21%) 127 654 (24%) 1,311 (48%) 278 
Opossum 50 (92%) 263 (23%) 6.7 (100%) 23 (16%) 156 499 (14%) 1,499 (42%) 158 
Platypus 31 (89%) 72 (16%) 10.0 (100%) 6 (13%) 107 234 (11%) 712 (34%) 137 
Chicken 40 (94%) 177 (38%) 6.0 (100%) 7 (26%) 95 336 (30%) 676 (61%) 146 
* Exonic length covered by unambiguous RNA-seq reads (percentage of the total exonic length of expressed genes). 
+ Intronic length covered by unambiguous RNA-seq reads (percentage of the total intronic length of expressed genes). 


£Total length covered by unambiguous RNA-seq reads (percentage of the total genomic length). 


§ Estimation of the total transcribed length in our data set (percentage of the total genomic length). This estimation includes all exonic and intergenic regions covered by unambiguous RNA-seq reads, as well as the 
intronic length of intergenic multi-exonic transcribed loci and of Ensembl-annotated genes detected as expressed in our samples. 


|| Number of unambiguously mapping reads that were used to compute the read coverage statistics. 
Mb, megabase. 


On the basis of the refined genome annotations, we remapped our 
RNA-seq reads and resolved read mapping ambiguities (Methods). In 
this Article, we focus on comparative analyses of expression levels of 
protein-coding genes. For comparisons among all ten amniote species, 
we used a set of 5,636 one-to-one (1:1) orthologues (Methods). A 
corresponding set of 13,277 1:1 orthologues was used for the six 
primates. Expression values were normalized to render the data com- 
parable across species (Methods). 


Mammalian gene expression phylogenies 

To obtain an initial overview of gene expression patterns, we per- 
formed a principal-component analysis, which clearly separates the 
data according to tissue (only the neural tissues do not perfectly 
separate), although a substantial part of the variance is also explained 
by differences among lineages (Fig. 1a). 

To reconstruct global evolutionary trends in more detail, we built 
expression distance matrices for each tissue (Methods) and recon- 
structed gene expression trees (Fig. 1b and Supplementary Figs 2 
and 3). The trees are highly consistent with the known mammalian 
phylogeny: they correctly resolve the three major mammalian lineages 
(placentals, or eutherians; marsupials; and monotremes), separate the 
two eutherian lineages (primates and rodents) and group humans and 
the other great apes to the exclusion of the macaque (an Old World 
monkey). This suggests that regulatory changes accumulate over 
evolutionary time, such that closely related species have more similar 
expression levels. Our results are remarkable given the within-species 
variation (including sometimes substantial sex-biased gene expression; 
Supplementary Note, Supplementary Table 3 and Supplementary 
Fig. 4) and the fact that age, feeding status and other characteristics 
of the individuals could not be perfectly matched between species, for 
practical and biological reasons. Thus, evolutionary signals inherent in 
our data, which may reflect changes in cellular gene expression levels or 
changes in the cellular composition of organs between species 
(Supplementary Note), outweigh the gene expression variability result- 
ing from sampling differences. 

However, branching patterns within the great ape clade do not 
always reflect the known phylogeny (Supplementary Fig. 2), in par- 
ticular for human, chimpanzee-bonobo and gorilla, which diverged 
only ~5-7 million years ago'*. Bootstrapping analyses show that the 
branching order of these four species cannot be robustly established 
for the somatic tissues on the basis of the 5,636 amniote 1:1 ortholo- 
gues (bootstrap values <0.9; Supplementary Fig. 2). 

To resolve great ape gene expression relationships, we built 
expression phylogenies based on the 13,277 primate 1:1 orthologues 
(Supplementary Fig. 5), which robustly resolve the great ape clade 
(that is, bootstrap values >0.9, except for brain) and reveal surprising 
patterns. In half of the expression trees (testis, heart and brain), 
humans and gorillas group together, whereas chimpanzees and bonobos 
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Figure 1 | Global patterns of gene expression differences among mammals. 
a, Factorial map of the principal-component analysis of messenger RNA 
expression levels. The proportion of the variance explained by the principal 
components is indicated in parentheses. b, Mammalian gene expression 
phylogenies. Neighbour-joining trees based on pairwise distance matrices 

(1 — p, Spearman’s correlation coefficient) for cerebellum and testis (see 
Supplementary Fig. 2 for all six organs). Bootstrap values (5,636 1:1 orthologous 
amniote genes were randomly sampled with replacement 1,000 times) are 
indicated by circles: white, >0.9; yellow, =0.9. Species colour codes: platypus, 
light blue; opossum, dark blue; eutherians (mice and primates), black. 
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fall outside of this clade. These two species always group together, as 
expected given their recent divergence’’, although chimpanzees and 
bonobos are not always monophyletic. The testis tree groups human 
with gorilla (bootstrap value of 1; Supplementary Fig. 5), consistent with 
the evolution of male physiology and mating patterns among the 
African apes: the highly promiscuous chimpanzees and bonobos 
evolved larger testicles relative to body size and higher sperm production 
rates than did the less promiscuous humans and gorillas*®. Whereas the 
kidney and cerebellum trees are consistent with the known species 
phylogeny, the liver tree has an interesting pattern: humans fall outside 
a clade comprising the other great apes, within which gorillas and 
orangutans group together. Given the role of the liver in metabolic 
control and detoxification, these patterns may reflect dietary variation 
among the great apes, although they may also reflect feeding status 
patterns of the individuals sampled. 


Rates of expression change in lineages and organs 


The branch lengths from the common ancestor of all species to the 
tips of the tree are remarkably similar (Supplementary Fig. 2), sug- 
gesting that gene expression evolution has proceeded at comparable 
rates in different mammalian lineages. However, the branches leading 
to mouse are significantly shorter in several tissues, particularly in 
comparison with those leading to great apes and monotremes 
(Bonferroni-corrected two-tailed P< 0.05 in four or three of the six 
tissues, respectively; randomization test), in spite of the high rodent 
DNA mutation rates”' (Supplementary Fig. 6; see Supplementary Fig. 7 
for results that remove within-species variation differences). This is 
consistent with the strong purifying selection affecting the rodent 
lineage because of large long-term effective population sizes**”’. Our 
observations, confirmed by another phylogenetic method (Sup- 
plementary Note), agree with previous inferences from gene expres- 
sion studies™ and protein sequence evolution” and lend support to 
previous models of gene expression evolution®, which assign a dom- 
inant role to purifying selection. 

The total branch lengths of the trees vary widely among tissues 
(Fig. 2a). The two neural tissues evolve significantly more slowly than 
the other organs in both amniotes as a whole and primates (Bonferroni- 
corrected two-tailed P< 0.001; randomization test), suggesting that 
they may have experienced stronger purifying selection and/or less 
positive selection than other tissues during mammalian evolution. 
This observation is remarkable in view of the substantial changes in 
the size, structure and cellular composition of the brain that occurred 
during mammalian evolution”, but is consistent with previous find- 
ings®***” which suggested that nervous tissues may have more fine- 
tuned expression networks than other organs. 

Liver, heart and kidney show similar rates of gene expression 
change in amniotes (randomization test not significant, P> 0.1), 
whereas in primates kidney evolves significantly more slowly than 
heart and liver (P< 0.05; Fig. 2a). Notably, the testis, previously 
shown to evolve rapidly both at the phenotypic”? and molecular 
levels®**, potentially owing to positive selection associated with sperm 
competition and other sex-related evolutionary forces”, is the most 
rapidly evolving tissue for both data sets (P< 0.001). 

Pairwise species comparisons confirm that gene expression diver- 
gence overall increases with evolutionary time (Fig. 2b), consistent 
with the expression phylogeny results (see above). However, for most 
tissues, expression levels are approximately as similar between human 
and chicken as they are between human and platypus, although the 
bird lineage diverged ~110 million years before the separation of 
monotremes and therian mammals (that is, eutherians and marsu- 
pials). This suggests that the conservation of core organ functions 
restricts transcriptome divergence. 


Gene expression evolution on the X chromosome 


Next we investigated the rate of gene expression change on the dif- 
ferent types of chromosome. Sex chromosomes of therians are derived 
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Figure 2 | Expression divergence rates across tissues and chromosomes. 

a, Comparisons of total branch lengths of expression trees among the six tissues 
(br, brain; cb, cerebellum; ht, heart; kd, kidney; lv, liver; ts, testis), for the all- 
amniote and primate data sets. Errors, 95% confidence intervals based on 
bootstrapping analysis (1,000 replicates, with one individual per species 
sampled in each replicate). b, Spearman’s correlations between humans and the 
other species. Coloured envelopes show ranges of values obtained in 100 
bootstrap replicates. c, Expression evolution rates on therian X chromosome 
versus autosomes. Rectangles reflect median branch lengths (1,000 bootstrap 
replicates) in X-chromosome expression trees (102 1:1 orthologues located in 
the X-chromosome conserved region”; red) relative to those in autosome trees 
(5,494 autosomal orthologues; white). P values are based on bootstrap 
replicates: an asterisk indicates two-tailed P< 0.05 (that is, branch longer in 
X-chromosome tree in more than 97.5% of replicates) and a plus sign indicates 
P<0O.1. 


from the same ancestral autosomes***', whereas the multiple X and Y 
chromosomes of the monotremes are distinct and partly homologous 
to the sex chromosomes of birds*”*’. To test whether gene expression 
evolution on the therian X chromosome accelerated after sex chro- 
mosome differentiation*’, we compared rates of expression change for 
genes that are X-linked in both eutherians and marsupials (that is, 
genes in the X-chromosome conserved region™*) and autosomal 
genes, on the basis of branch lengths in expression trees reconstructed 
for the two types of chromosome (Fig. 2c). 

This analysis suggests that gene expression evolution was faster on 
the X chromosome than on autosomes in the common ancestor of 
therian mammals (two-tailed P< 0.05 for brain, cerebellum and 
heart; P< 0.1 for kidney, liver and testis; randomization test), which 
corresponds to the time when the original proto-XY chromosomes 
evolved into sex chromosomes, and remained accelerated in the common 
eutherian ancestor (two-tailed P<0.05 for brain, cerebellum and 
kidney; P< 0.1 for heart, liver and testis). By contrast, the rate of 
X-chromosome expression evolution was similar to that of autosomes 
more recently, as reflected by the terminal eutherian branches (P > 0.1 
for all tissues and branches), consistent with our hypothesis that gene 
expression evolution proceeded at a higher rate only on the newly 
formed X chromosome. 

The observed pattern is unlikely to reflect new general properties of 
the X chromosome as a sex chromosome (for example its reduced 
effective population size or reduced recombination rates), as such a 
property would lead to accelerated evolution on all branches following 
X-chromosome origination. It may instead reflect an increased rate of 
functional adaptation on the newly formed X chromosome, potentially 
driven by sex-related selective forces that started to shape this chro- 
mosome after sex chromosome differentiation*’, and/or selective pres- 
sures associated with the X-chromosome dosage reduction in males 
resulting from Y-chromosome degradation’ (see also below). In this 
context, it is noteworthy that the rate of protein sequence change*’ 
(except for X-linked genes with Y-chromosome counterparts**) and 
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the rate of fixation of new genes on the X chromosome” seem to have 
increased after differentiation of the sex chromosome. Thus, similarly 
to Drosophila”, early X-chromosome evolution in mammals seems to 
be characterized by increased rates of functional adaptation of genes. 


Modular gene expression change 


Given that genes commonly function together, concerted expression 
changes of distinct sets of genes may often be phenotypically relevant. 
To identify such expression shifts, we identified groups of genes that 
have coherent expression patterns over subsets of samples*? (Sup- 
plementary Note). These ‘modules’ were screened for statistically 
significant enrichments of functional categories. 

Among a total of 639 modules in the all-amniote data and 197 
modules in the primate-specific data set (Supplementary Tables 
4-7; see also the searchable database with comprehensive module 
details at http://www.unil.ch/cbg/ISA/species), there are 33 organ- 
specific modules with conserved expression levels among species 
(17 for amniotes and 16 for primates), 145 modules specific to an 
organ (or organ pair; see below) with distinct lineage-specific expres- 
sion patterns (124 for amniotes and 21 for primates), and 658 
modules that show no clear relation to specific phylogenetic groups 
and/or affect multiple organs (498 for amniotes and 160 for primates) 
(Supplementary Tables 8 and 9). 

The 33 organ-specific conserved modules are enriched with genes 
involved in typical processes (for example synaptic transmission 
for brain; Benjamini-Hochberg corrected P < 0.05), and thus define 
common primate/mammalian organ functions (Supplementary 
Tables 8-10). 

The 145 organ-specific modules with lineage-specific expression 
patterns provide clues to the organ biology of different mammals. For 
example, the all-amniote data reveal 25 nervous tissue modules that 
evolved distinct expression levels along the major terminal branches of 
the mammalian phylogeny (Fig. 3a and Supplementary Tables 8 and 
10). Notably, modules specific to the central nervous system in the non- 
primate mammals often (14 of 16 cases) show altered expression in both 
brain (or cerebral cortex) and cerebellum (Supplementary Table 8), 
suggesting a tight functional and evolutionary link between them in 
mammals. Similarly, modular expression changes in kidney and liver 
often (14 of 28 cases) affected both of these organs, which may reflect 
their close functional interactions regarding detoxification and waste 
excretion. The only terminal lineages with distinct testis modules are 
primates and monotremes (Supplementary Tables 8 and 10). 

Among the 32 modular gene expression changes that occurred along 
the internal branches of the mammalian phylogeny, eight modules in 


brain, cerebellum and/or testis are highly enriched with X-linked genes 
(Benjamini-Hochberg corrected P < 0.05) and became strongly down- 
regulated along the common therian or eutherian branch during sex 
chromosome differentiation (Supplementary Tables 8 and 10), consist- 
ent with observations that reduced gene dosage on the newly evolved 
therian X chromosome in males was not compensated for by global 
transcriptional upregulation of X-linked genes (P. Julien et al., submitted 
manuscript, and ref. 41). 

Modular expression changes between mammals and chicken 
occurred only in the neural tissues and in kidney and liver 
(Supplementary Tables 8 and 10). Four of these modules are signifi- 
cantly enriched with X-linked genes (Benjamini-Hochberg corrected 
P<0.01). Our results suggest that the early evolution of the mam- 
malian brain was strongly associated with X-chromosome expression 
changes, perhaps because of an overrepresentation of proto-X-linked 
brain genes”. 

The only lineage with brain-specific (that is, prefrontal cortex; 
Supplementary Note) expression modules in the primate data set is 
that of humans (Supplementary Tables 9 and 10). The 259 genes in the 
largest of the four human-specific brain modules (ID #p173 in 
Supplementary Table 10) are involved in various neurological pro- 
cesses, several of which (for example cell adhesion molecules; 
Benjamini-Hochberg corrected P< 0.05) were previously found to 
be enriched in analyses of regulatory sequence differences between 
humans and chimpanzees’. Notably, the large number of gene 
ontology categories (12 of 39) related to neuron insulation probably 
reflects the larger proportion of myelinated axons (white matter) in 
the human prefrontal cortex than in that of other primates, implying 
an increased connectivity of this region with other cortical areas. 


Expression shifts of individual genes 

To detect biologically relevant expression changes of individual genes, 
we developed a maximum-likelihood framework for modelling gene 
expression evolution along a phylogeny. We compared several models 
that incorporate selection and genetic drift and take into account 
within-species variations and measurement errors (Supplementary 
Note). To detect relevant lineage-specific changes, we compared a 
model that assumes a single optimum expression level for a given 
gene in all branches of the phylogeny with a model in which this gene 
evolved a different expression optimum in a specified lineage. 

Using this approach, we identified 9,255 significant expression 
changes (Benjamini-Hochberg corrected two-tailed P< 0.05; log- 
likelihood ratio tests; total number of tests, 577,105; Table 2 and 
Supplementary Tables 11-26). Notably, 2,452 (~63%) of 3,909 tested 


a Human Chimp } Bonobo } Gorilla ‘Orang.iMacaque' Mouse Opossumi Platypus | Chicken | b LIX1 COL25A1 TRMTI1L 
1.07; ; ! Brain Brain Cerebellum 
0.811 a eka a aa ail a 
0.6. =~ 

g 
0.4: a 
- 0.2 a 
3 (0.04! 2 
a ' = 
g i : 
a H & 
5 40,! Module 511 € 
BOT Neurological system process I 
8 0.8 : Nervous system development x 
iy 
0.6 H <J Module p173 S 
: Ensheathment of neurons 
0.4 Axonogenesis/axon guidance 
Cell adhesi 
0.2 ieee Bi Chicken i Opossum OD Macaque D Gorilla i Bonobo 
0.0 Gi Brain Gj Cerebellum Other I Platypus i Mouse Orangutan i Chimpanzee HB Human 


Figure 3 | Lineage-specific expression shifts of transcription modules and 
individual genes. a, Modules with specific expression states in human brain 
(prefrontal cortex; 259 genes) and primate cerebellum (189 genes) are shown. 
Bars represent the weighted average expression of all genes in a module, for 
each sample (horizontal grey line indicates average bar height). The horizontal 
red line represents the cut-off of the biclustering algorithm; samples above the 
red line are considered to have a distinct expression state. See Supplementary 
Note and our searchable database (http://www.unil.ch/cbg/ISA/species) for 
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details. b, Examples of genes that evolved new optimal expression levels in 
human prefrontal cortex (LIX1; ENSG00000145721), primate cortex 
(COL25A1; ENSG00000188517) and platypus cerebellum (TRMTIL; 
ENSG00000121486). Expression levels are indicated as log,-transformed 
RPKM (reads per kilobase of exon model per million mapped reads) (see 
Supplementary Tables 11-26 for details). Errors, range of expression values for 
the different individuals for a given species or tissue. 
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Table 2 | Numbers of genes with significant lineage-specific expression switches* 


Gene set Lineage Brain Cerebellum Heart Kidney Liver Testis 

Primate orthologues Human 4 303 13 253 93 628 
Chimp—bonobo 11 16 4 y i 202 
Chimp 0 2 ) 4 0) 25 
Bonobo ) 2 0 8 11 29 
Gorilla 18 73 140 62 37 227 
Orangutan 91 159 v3 95 52 — 
Human-chimp—bonobo ) ci 0) 0 0 4 
African apes ) 46 ) 0 0 554 
Great apes} (macaque) 30 339 18 215 232 556 

All-amniote orthologues Primates 141 5 2 3 3 3 
Mouse 33 81 96 154 90 205 
Opossum 90 195 100 154 113 222 
Platypus 323 358 264 280 207 326 
Eutherians 0) 5 11 5 2 0 
Therians 54 50 31 36 19 59 
Mammals# (chicken) 135 153 106 141 145 207 


* Changes detected using our phylogenetic maximum-likelihood procedure at a false-discovery rate of <0.05. 
+ Expression switch on the branch connecting great apes and macaque. Changes cannot be polarized owing to a lack of outgroup species. 
¢ Expression switch on the branch connecting mammals and chicken. Changes cannot be polarized owing to a lack of outgroup species. 


amniote orthologues and 3,314 (~33%) of 9,969 tested primate ortho- 
logues experienced at least one significant expression shift in one of 
the six organs during amniote or primate evolution, respectively. Our 
method is designed to detect selectively driven expression shifts, but 
might also detect shifts due to genetic drift or other non-adaptive 
forces (for example, biased gene conversion**), gene dosage altera- 
tions (for example, during sex chromosome differentiation; see below 
and P. Julien et al., submitted manuscript) or cellular composition 
changes. In any case, our results provide an extensive list of candidates 
for potentially adaptive expression changes. 

Although the data are not always directly comparable between 
tissues and lineages, owing to differences in statistical power asso- 
ciated with sampling and/or branch length differences, some global 
patterns stand out. In 11 of 15 mammalian lineages, the testis features 
the largest number of significant changes (Table 2), indicating that 
the rapid divergence of the testis (Fig. 2a, b) is at least partly explained 
by specific selective regimes acting on this organ, consistent with 
previous human-chimpanzee expression comparisons” and protein- 
coding sequence analyses**. By contrast, the brain shows few expres- 
sion shifts in mammals, in agreement with the low rate of gene 
expression change in this tissue (Fig. 2a, b), with one notable exception: 
along the primate ancestral branch, by far the largest number of sig- 
nificant shifts (141 of 157) is found for the brain, which may be 
explained by the evolution of more complex cognitive functions, 
alterations of cellular composition and/or sampling differences 
(prefrontal cortex was sampled for primates, and whole brain, except 
cerebellum, was sampled for non-primates; Supplementary Note). The 
evolution of the cerebellum involved larger numbers of significant 
expression shifts than did that of the rest of the brain in 14 of 16 
lineages (Table 2); this might underlie alterations in motor control 
functions among mammals. 

Literature searches provide insights into potential functional impli- 
cations of expression shifts for some of the most significant changes 
(see Supplementary Tables 27-42 for gene ontology analysis results). 
For example, the top candidate for adaptive expression change in the 
cortex on the terminal human branch, LIX1 (Fig. 3b), which is 
strongly upregulated in humans (Benjamini-Hochberg corrected 
P=0.0183), has a crucial role in motor neuron development and 
maintenance”. This gene is also upregulated in the human cerebellum 
(P = 0.0209) and is a component of the human-specific brain expres- 
sion module (ID #p173 in Supplementary Table 10 and database at 
http://www.unil.ch/cbg/ISA/species). Other examples (Supplemen- 
tary Figs 8 and 9) include COL25A1, which has the most significant 
expression shift in the brain of the common primate ancestor 
(reduced expression, corrected P = 0.0032; Fig. 3b) and leads to beha- 
vioural abnormalities when overexpressed in animal models**; and 
TRMTIL, the TRM1-like gene that affects motor coordination and 


exploratory behaviour* and shows higher expression levels in 
platypus brain (corrected P< 107°; Fig. 3b). 

Our analysis of tissue transcriptomes from all major mammalian 
lineages refines previous hypotheses and provides many new clues to 
the function and evolution of mammalian genomes. This work marks 
the beginning of the exploitation of the reported transcriptome data, 
which will facilitate future investigations of mammalian genome biology. 


METHODS SUMMARY 


We extracted high-quality RNA and prepared 131 polyadenylated RNA-seq 
libraries using standard protocols. Libraries were sequenced on Illumina 
Genome Analyser IIx platforms. We refined existing genome annotations to 
resolve potentially confounding factors and establish constitutive and alternative 
exons using a segmentation-clustering approach. We constructed orthologous 
gene sets based on retrieved lists of 1:1 orthologous genes for each species pair. To 
assess the influence of annotation heterogeneities on between-species variation, 
we determined sets of constitutive exon sequences that perfectly align across all 
species. On the basis of our refined annotations, final read mapping positions 
were established using a procedure that resolves read mapping ambiguities. We 
calculated standard expression values (RPKM) that were normalized across 
species and tissues on the basis of rank-conserved genes and a median-scaling 
procedure. Various biological analyses were performed using these data, includ- 
ing the development of a phylogenetic maximum-likelihood approach to detect 
significant expression shifts of individual genes. 

Sequencing data have been deposited in the Gene Expression Omnibus and 
have been made directly available to Ensembl for annotation purposes. The 
different expression level datasets are provided as Supplementary Data Sets 1 
and 2. All intermediate and final results and data are available from the authors 
on request. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Samples and RNA sequencing. The 131 organ samples that provided the founda- 
tion for this study were obtained from various sources (Supplementary Note, 
section 1.1, and Supplementary Tables 1 and 2). To ensure comparability of data 
derived from homologous organs between species, several measures were taken. 
Most of the organs studied represent heterogeneous tissues whose structural and 
cellular composition may vary between species. To account for this issue, major 
parts of each tissue (covering the different structures/cells) were sampled and 
homogenized before RNA extraction, where possible. Given that the brain is a 
particularly heterogeneous tissue, we sampled two parts of the brain (prefrontal 
cortex and brain without cerebellum, depending on the species; cerebellum) for 
each species, one of which (cerebellum) is well defined, structurally similar and 
easily dissectible in all species in spite of the major differences in brain size among 
the amniote species studied (see Supplementary Note, section 1.1, for details). 

Total RNA was extracted using the Trizol (Invitrogen) procedure or RNAeasy/ 
RNAeasy Lipid/miRNeasy (Qiagen) column purification kits as indicated in 
Supplementary Table 1. RNA quality was assessed using an Agilent 2100 
Bioanalyser. Only samples with high RNA integrity values (Supplementary 
Table 1) were used in this study. 

Sequencing libraries were prepared using the mRNA-Seq Sample Prep Kit 
(Illumina) according to the manufacturer’s instructions. Briefly, polyadenylated 
RNA was isolated using a poly-dT bead procedure and then chemically fragmented 
and randomly primed for reverse transcription. After second-strand synthesis, the 
ends of the double-stranded complementary DNA were repaired. After 3’-end 
adenylation of these products, Illumina Paired-End Sequencing adapters were 
ligated to the blunt ends of the cDNA fragments. Ligated products were run on 
gels; 250-300-bp fragments were excised and then PCR-amplified (15 cycles). 
After column purification, qualities of the resulting libraries were assessed using 
Agilent 2100 Bioanalysers. Potential influences on RNA sequencing results due to 
different experimenters preparing the libraries were ruled out on the basis of RNA- 
seq data analysis of replicate libraries prepared by the different experimenters 
(Supplementary Note, section 1.2). The RNA-seq libraries were each sequenced 
(76 cycles) in at least one lane of the Illumina Genome Analyser IIx platform 
according to the manufacturer’s specifications. Technical replicates (that is, 
running the same library on different machines) were performed to rule out 
potential biases during the sequencing step (Supplementary Note, section 1.2). 

After sequencing, we processed the fluorophore intensity files with the IBIS 

base caller, in addition to applying the standard Illumina _base-calling 
algorithms. All subsequent analyses were performed on the IBIS-called reads, 
as the number of correctly mapped reads was significantly increased with this 
base-calling approach (Supplementary Note Fig. 1 and Supplementary Note 
Table 1). 
Initial read mapping and refinement of genomic annotations. We used 
TOPHAT"™ to align the reads to the reference genome sequences and to extract 
splice junctions, without relying on the genome annotations. We developed a 
specific procedure to improve the splice junction detection for genes that have 
recent retrocopies (Supplementary Note, section 1.3). We filtered the alignments 
provided by TOPHAT to extract unambiguously mapping reads, from which we 
built a set of transcribed islands and splice junctions for each RNA-seq sample. 
Only junctions with GT-AG and GC-AG splice sites, for which the sense strand 
can be reliably inferred, were included in this analysis. 

We extracted genomic annotations from Ensembl”, release 57. These annota- 
tions were further extended by recursively adding the coordinates of the tran- 
scribed islands to the gene models, for those islands that were connected by splice 
junctions to previously known exons of the same gene. For this procedure, we 
only considered transcribed islands that were supported by at least two un- 
ambiguously mapping reads, that were in proximity to genes (distance to previ- 
ously known gene boundaries, <100 kb) and that were not connected to multiple 
genes. 

As the annotation extension procedure can result in the inclusion of (possibly 
non-functional) retained introns in the gene models, we further refined the 
annotations by using the splice junction coordinates to define exon boundaries 
precisely (Supplementary Note, section 1.4). 

Constitutive exon definition. Before evaluating gene expression levels, we 
sought to eliminate minor splice isoforms from the gene models, to reduce the 
level of splicing-related noise in our data. As proposed previously”, we used 
information on splice junctions to detect four types of alternative transcription 
event: skipped (or cassette) exons, retained introns and alternative 5’ and 3’ splice 
sites. In addition, we analysed the read coverage variation within genes with a 
segmentation-clustering approach™ to define and remove regions with unusually 
low coverage (Supplementary Note, section 1.5). For each alternative transcrip- 
tion event, we quantified the two possible isoforms on the basis of the splice 
junction frequencies and on the read coverage (Supplementary Note, section 
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1.5). We then defined ‘constitutive’ exon segments as those segments that belong 
to isoforms with frequencies of more than 15%, in all samples. For genes with low 
expression (that is, with an average per-base read coverage of less than three), all 
exons were considered to be constitutive, because the minor isoform identifica- 
tion is not feasible in these cases. 

Orthologous gene sets and exon sequence alignment. We retrieved the list of 1:1 
orthologous genes for each pair of species in our set from the Ensembl database”, 
release 57. From these pairwise orthology relationships, we extracted 5,636 gene 
families that have 1:1 orthology relationships among all the species in our set, as 
well as 13,277 gene families for primates. 

Given the heterogeneity of genomic annotations, we wanted to exclude the 

possibility that gene expression variation between species might be due to the fact 
that gene expression levels are computed on heterogeneous sequences. Thus, we 
built a set of constitutive aligned exons for the 1:1 orthologous families. To do so, 
we aligned the cDNA sequences of the orthologous gene families using TBA**. We 
filtered these alignments to extract perfectly aligned blocks of sequence (no gaps 
were permitted) that corresponded to exon parts considered to be ‘constitutive’ in 
all species. 
Final read mapping. To ensure unambiguous read mapping and optimal sub- 
sequent calculations of expression levels, the final read mapping procedure was 
based on our refined genome annotations (see above) and involved several steps 
(Supplementary Note Fig. 14). To prepare for the mapping of reads, we first built a 
library of splice junction sequences on the basis of the refined exon annotations. 
Asa further preparation step, we then sought to assess the number of theoretically 
possible unique reads per given annotation element (exon, exon part and so on). 
Specifically, we derived all possible read sequences for each annotation (~200 
million reads, depending on the genome) and mapped each of these artificial 
reads onto the respective genome sequence as well as the sequences from the 
splice junction library, using BOWTIE”. We then calculated the unique read 
coverage per genomic element and stored this information for the mapping 
procedure. 

The final mapping positions of RNA-seq reads for a given genome were estab- 
lished as follows. We first mapped each read onto the genome sequence and (in 
parallel) the sequences from the splice junction library, using BOWTIE”. This 
mapping information served as input for an algorithm that was designed to 
resolve ambiguities of reads with multiple mapping positions in the genome 
and calculate basic expression level values for each gene. Specifically, in the case 
of overlapping mappings, the mappings with the lowest numbers of mismatches 
were chosen (in the case of identical numbers of mismatches, spliced reads were 
favoured). Reads that mapped equally well to different genomic loci (for example 
to different duplicate gene copies) were resolved in the following way. We first 
calculated preliminary transcription levels by dividing the number of reads that 
map uniquely to each locus by its unique read coverage (see above). Non-unique 
reads were then distributed among annotated genomic elements according to 
these ratios (that is, loci received reads in proportion to their unique read 
mapping ratios). If two or several loci had identical sequences (that is, they had 
no uniquely mapping reads), reads were distributed evenly among them—if they 
were all multi-exonic. In the case of multi-exonic ‘parental’ genes and their 
identical retroposed gene copies, reads were assigned exclusively to the parental 
genes, given that the majority of retrocopies (in particular recent ones) were likely 
to be non-functional or at least expressed at very low levels”*. 

Expression levels and normalization. On the basis of final read assignments 
described in the previous section, we calculated standard RPKM expression values” 
(that were then log-transformed) for the orthologous genes as defined above. To 
render the data comparable across species and tissues, we then normalized these 
expression values by a scaling procedure. Specifically, among the genes with 
expression values in the interquartile range, we identified the 1,000 genes that have 
the most-conserved ranks among samples and assessed their median expression 
levels in each sample. We then derived scaling factors that adjust these medians to a 
common value. Finally, these factors were used to scale expression values of all genes 
in the samples (Supplementary Note Fig. 16). We note that other normalization 
procedures led to similar results. 

Biological analyses. Various biological analyses were performed using these 
data, which also involved the development of a new method (a phylogenetic 
maximum-likelihood approach to detect selectively driven expression change). 
For a full description of these analyses and newly developed approaches, see 
Supplementary Note. 

The principal-component analysis on gene expression levels was performed 
using the ‘prcomp’ function in the ‘stats’ package in R (http://www.R-project.org/). 

We constructed expression trees using the neighbour-joining approach, based 
on pairwise distance matrices between samples. The distance between samples 
was computed as 1 — p, where p is Spearman’s correlation coefficient, because it 
is insensitive to outliers and any potential inaccuracies in the normalization 
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procedure. Euclidean distances were used as a control (Supplementary Fig. 3). 
The neighbour-joining trees were constructed using functions in the ‘ape’ pack- 


age” 


in R. The reliability of branching patterns was assessed with bootstrap 


analyses (the 5,636 amniote 1:1 orthologous genes and the 13,277 primate 1:1 
orthologous genes were randomly sampled with replacement 1,000 times). The 
bootstrap values are the proportions of replicate trees that share the branching 
pattern of the majority-rule consensus tree shown in the figures. 
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PDGF signalling controls age-dependent 
proliferation in pancreatic B-cells 
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Determining the signalling pathways that direct tissue expansion is a principal goal of regenerative biology. Vigorous 
pancreatic B-cell replication in juvenile mice and humans declines with age, and elucidating the basis for this decay may 
reveal strategies for inducing p-cell expansion, a long-sought goal for diabetes therapy. Here we show that 
platelet-derived growth factor receptor (Pdgfr) signalling controls age-dependent f-cell proliferation in mouse and 
human pancreatic islets. With age, declining B-cell Pdgfr levels were accompanied by reductions in B-cell enhancer of 
zeste homologue 2 (Ezh2) levels and B-cell replication. Conditional inactivation of the Pdgfra gene in B-cells accelerated 
these changes, preventing mouse neonatal B-cell expansion and adult B-cell regeneration. Targeted human PDGFR-a 
activation in mouse f-cells stimulated Erk1/2 phosphorylation, leading to Ezh2-dependent expansion of adult B-cells. 
Adult human islets lack PDGF signalling competence, but exposure of juvenile human islets to PDGF-AA stimulated 
B-cell proliferation. The discovery of a conserved pathway controlling age-dependent B-cell proliferation indicates new 


strategies for B-cell expansion. 


Acquired tissue insufficiency underlies the pathogenesis of diverse 
human diseases; for example, absolute or relative deficits in pancreatic 
B-cells underlie diabetes mellitus. To expand insulin-secreting B-cells, 
investigators have modulated intracellular factors, including f-cell 
cycle regulators, or activated intrinsic regulatory pathways with 
extrinsic mitogenic factors’”. In mice or other experimental animals, 
B-cell expansion from such efforts has been transient or limited, and 
often complicated by uncontrolled f-cell growth, loss of defining 
B-cell features and impaired metabolic control**. Likewise, induction 
of human islet B-cell proliferation remains an unattained goal, with 
studies reporting inconsistent mitogenic responses*®. Thus, we 
sought to identify native signalling pathways governing physiological 
B-cell proliferation and regeneration. 

During physiological growth in humans, mice and other species, 
juvenile B-cells expand by self-renewal’*® while maintaining their 
hallmark functions. Pancreatic B-cell proliferation decreases rapidly 
in juvenile mice and humans, then declines more slowly in adults??°. 
Studies indicate that age-dependent increases of p16’, a cyclin- 
dependent kinase inhibitor encoded by Cdkn2a, restrict proliferation 
of mouse and human f-cells and other tissues with ageing’’’*. We 
showed that the polycomb group protein Ezh2 represses Cdkn2a and 
promotes f-cell proliferation in juvenile mouse islets’. Ezh2 is a 
histone methyltransferase crucial for trimethylation of histone H3 
on Lys27 (H3K27me3), a modification mediating transcriptional 
repression’*’*. Ezh2 expression in mouse and human f-cells declines 
rapidly in juveniles, then slowly in adults’’, similar to the tempo of 
age-dependent decline in B-cell proliferation®’®. Conditional Ezh2 
inactivation in B-cells accelerated loss of H3K27me3 repression at 
Cdkn2a, resulting in premature p16“ expression, impaired B-cell 
proliferation, reduced [B-cell mass and inadequate B-cell regenerative 
recovery after B-cell ablation’’. We proposed that conserved regula- 
tory pathways governing Ezh2 expression in ageing islets could be 
targeted to stimulate adult B-cell proliferation. 


Pdgfr signalling promotes proliferation, survival and migration in 
diverse cell types'’. Pdgf signalling activation stimulates DNA syn- 
thesis in cultured islets'”'*, but the roles of Pdgf signalling in B-cell 
proliferation and expansion were unknown. Pdgf-induced actin 
polymerization in fibroblasts requires Ezh2 (ref. 19), suggesting that 
Pdgf signalling might regulate Ezh2 expression and age-dependent 
B-cell proliferation in islets. Here we demonstrate that a pathway 
regulated by Pdgfr-«, governs the expression and activity of intrinsic 
B-cell factors—including the retinoblastoma protein (Rb), E2f tran- 
scription factors and Ezh2—to control B-cell proliferation. Modulation 
of B-cell Pdgf signalling attenuates age-dependent proliferative failure 
and Ezh2 loss, permitting in vivo expansion of B-cells with regulated 
function. Cardinal features of this pathway are conserved in human 
islets, permitting conditional induction of human f-cell replication. 


Age-dependent f-cell Pdgfr-a loss 


To investigate whether Pdgf signalling might regulate islet B-cell pro- 
liferation and Ezh2 expression, we assessed Pdgf receptor and ligand 
expression in islets and B-cells. Immunohistology revealed that Pdgfr-o 
and Pdgfr-B levels were markedly reduced in islet B-cells at 6 weeks 
and 6 months of age, compared to neonatal islets (Fig. la and Sup- 
plementary Fig. 1a). Real-time polymerase chain reaction with reverse 
transcription (RT-PCR) studies of B-cells purified by fluorescence- 
activated cell sorting (FACS) confirmed that Pdgfra, Pdgfrb and Ezh2 
messenger RNA levels were reduced in adult B-cells at 5 months com- 
pared to 2-week-old B-cells (Fig. 1b). Further RT-PCR analyses of 
isolated islets from wild-type mice ranging from 2 weeks to 13 months 
revealed an age-dependent decline of islet mRNAs encoding Padgfra, 
Pdgfrb and Pdgf ligands (Fig. 1c and Supplementary Fig. 1b-e). In 
contrast, islet mRNAs encoding the insulin receptor, prolactin recep- 
tor, glucocorticoid receptor and intrinsic regulators of B-cell prolifera- 
tion like Pdx1, Foxol, Foxo3, Rb1 (hereafter referred to as Rb), p130 
(also known as RbI2) and E2f1, did not decline in mice with advancing 
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Figure 1 | Age-dependent attenuation of Pdgfr-a limits B-cell Ezh2 
expression and proliferation in neonatal and juvenile mice. a, Pdgfr-o and 
insulin immunostaining on wild-type mouse pancreas sections at indicated 
ages. DAPI, 4’,6-diamidino-2-phenylindole. Scale bar, 25 um. b, c, mRNA 
levels of indicated genes in FACS-purified f-cells (b) or wild-type islets (c) at 
specified ages. n = 3-5 experiments per group or time point. *P < 0.05, 


age (Supplementary Fig. 2). Thus, reduced B-cell levels of Pdgf recep- 
tors and ligands corresponded with Ezh2 loss and age-dependent 
reduction of f-cell replication”’. 

To assess whether premature Pdgfr loss might compromise Ezh2- 
dependent neonatal B-cell proliferation, we generated RIP-Cre; 
Pdgfra" mice (abbreviated to BPdgfraKO) that permit conditional 
Padgfra inactivation in islet h-cells and loss of detectable Pdgfra mRNA 
and protein expression (Fig. ld and Supplementary Fig. 3a). 
Compared to controls, B-cell Ezh2 protein and islet Ezh2 mRNA 
levels were reduced whereas p16'Y“** mRNA levels were increased 
in BPdgfraKO islets (Fig. le and Supplementary Fig. 3a, b); in con- 
trast, Pdgfrb and EzhI mRNA levels were unchanged in BPdgfrakKO 
islets (Supplementary Fig. 3c, d). In 2-3-week-old BPdgfraKO mice, 
we detected a threefold reduction in the B-cell proliferation rate and a 
50% reduction in B-cell mass (Fig. 1f, g). BPdgfraKO mice at 3 weeks 
had mild hyperglycaemia during ad libitum feeding and impaired 
glucose tolerance in a glucose challenge test (Supplementary Fig. 3e, 
f). Thus, premature loss of Pdgfra impairs B-cell proliferation and 
expansion and glucose control in young mice. 


Pdgfr-« controls adult B-cell regeneration 

Destruction of B-cells by streptozotocin (STZ) in mice provokes B-cell 
proliferation with regeneration of B-cell mass'’’*. Following STZ 
exposure, Pdgfra and Pdgfrb were induced in wild-type islets 
(Fig. 2a, b), suggesting possible roles for Pdgfr signalling in adult 
B-cell regeneration. Consistent with this view, B-cell mass in control 
mice gradually recovered after STZ treatment, accompanied by 
increased f-cell Ezh2 expression and proliferation; in contrast, 
BPdgfrakKO f-cells failed to increase Ezh2 or proliferation to restore 
B-cell mass (Fig. 2c-e). BPdgfraKO mice developed severe diabetes 
that persisted throughout the course of our studies (Fig. 2f). In con- 
trast, control mice became moderately diabetic immediately after STZ 
challenge, but achieved restored glucose control 3-4 weeks thereafter 
(Fig. 2f), coinciding with increases of B-cell mass (Fig. 2e). Thus, 
Pdgfr-« is also required in adult B-cells for compensatory prolifera- 
tion in experimental diabetes. 
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**P < 0.01 compared to samples from 2 weeks (b) or 15 days (c). Insr. refers to 
mRNA encoding insulin receptor. d—g, Relative mRNA levels of Pdgfra (d) and 
Ezh2 (e) in isolated islets, and percentage -cell BrdU incorporation (f) and 
pancreatic B-cell mass (g) of BPdgfraKO and control mice at specified ages. 
n = 3-5 independent islet preparations or mice per group. *P < 0.05, 

**P < 0.01 for the comparison as indicated. Error bars denote s.e.m. 


PDGFR-« promotes functional B-cell expansion 


To test whether enhanced Pdgfr signalling might promote B-cell expan- 
sion, we exposed islets from juvenile and adult mice to physiological 
concentrations of platelet-derived growth factor-AA (PDGF-AA). 
Exposure of 3-week-old islets to PDGF-AA increased phospho- 
Pdgfr-a« levels and Ezh2 mRNA and protein levels, but not Ezh1 
expression (Supplementary Fig. 4a-c). In contrast, juvenile islets 
exposed to mitogens like insulin or prolactin did not detectably alter 
Ezh2 mRNA levels (Supplementary Fig. 4d, e). Unlike in 3-week-old 
islets, levels of Ezh2 mRNA and protein were not significantly increased 
in 7- or 9-month-old adult islets exposed to PDGF-AA (Supplementary 
Fig. 4c, f). PDGF-AA exposure increased [-cell proliferation in juvenile 
islets, as assessed by BrdU incorporation, but not in adult islets 
(Supplementary Fig. 4g, h). Increased Ezh2 expression and f-cell 
BrdU incorporation were eliminated by simultaneous treatment with 
the receptor tyrosine kinase inhibitors Sunitinib”® or Vargatef*! (Sup- 
plementary Fig. 4a, c, g, h). Thus, PDGF-AA exposure was sufficient to 
stimulate B-cell replication in juvenile islets, but loss of competence for 
Pdgf prevented this response to PDGF-AA in adult islets. 

To investigate whether sustained PDGFR signalling permitted 
adult B-cell expansion in vivo, we intercrossed the RIP-Cre strain with 
mice harbouring a human PDGFRA”**’Y allele encoding a ligand- 
independent activated receptor inserted at the ROSA26 (R26) locus”. 
A loxP-flanked transcriptional stop sequence permits Cre-dependent 
PDGFRA”**’’ expression and activation of PDGF signalling in vivo”. 
In contrast to littermate controls, mice with the RIP-Cre; R26- 
PDGFRA”**’Y genotype (abbreviated to BPDGFR«Tg) expressed 
human PDGFRA mRNA and PDGFR-«(D842V) protein in islet 
B-cells (Fig. 3a and Supplementary Fig. 5a). Expression of R26- 
PDGFR«”*’Y did not affect endogenous islet Pdgfra mRNA levels 
(Fig. 3a) or B-cell expansion and glucose control in neonatal or juvenile 
BPDGFRaTg mice (Fig. 3b, cand Supplementary Figs 5b and 6a-d). By 
3 months, B-cell Ezh2 expression had declined in control mice, but 
B-cell Ezh2 expression in BPDGFRaTg mice was maintained at ele- 
vated levels, a difference that persisted for 8-14 months (Fig. 3e, f). 
mRNA levels of islet p16"““** and p19*"F, both targets"? of Ezh2, were 


©2011 Macmillan Publishers Limited. All rights reserved 


a Pdgfr-o 


i 


ARTICLE 


BPdgfrakO 


. 10 : | | Control . L a eles 
2 Bvenicie 10 & 600 * 
s ee BPdgfrakKO co) D 
2 8) ,, USTZ S ay — BPPos E = = 4 
aT 2 2 @ 400 
z ° 3B 6 g g 
E 4 - £ 8 
2 pulls 3 . g 2 200 @ Control 
Bg? o 2 oe 8 © pPdgfrakO 
co 0 co 0 
ge xe No Nd 3 21 58 0 10 20 30 40 50 60 
Oe) ee) ioe & 
g g f ie Days after STZ Days after STZ 
Xo 


Figure 2 | Pdgfra loss impairs f-cell regeneration in STZ-induced diabetes. 
a, b, Immunostaining of Pdgfr-« and insulin on pancreas sections (a), and 
relative mRNA levels of Pdgfra and Pdgfrb in islets (b) from 8-week-old wild- 
type mice 1 week after vehicle or STZ (100 mgkg * body weight) treatment. 
n = 3-5 mice per group. a, Yellow arrows mark Pdgfr-o induction in a subset of 
insulin* cells. c, Representative images showing immunostaining for insulin, 
Ezh2 and BrdU on pancreatic sections from indicated 6-7-week-old mice 3 


reduced in BPDGFRaTg mice compared to those in littermate controls 
(Supplementary Fig. 6f, g). In contrast, islet mRNA levels of Ezh1 in 
BPDGFRaTg and control mice were indistinguishable (Supplemen- 
tary Fig. 6h). B-Cell proliferation in 14-month-old BPDGFRaTg mice 
was increased ninefold compared to age-matched controls, a level of 
proliferation seen in 3-month-old control B-cells (Fig. 3d, f). B-Cell 


weeks after STZ (150 mg kg * body weight) treatment. d, Percentage of BrdU~ 
insulin* cells (B-cells) or BrdU* acinar cells 3 weeks after STZ challenge. n = 3 
or 6 mice per group. e, f, Pancreatic (}-cell mass (e) and blood glucose (f) levels 
during ad libitum feeding after STZ treatment. n = 3-8 mice (e) or 18-21 mice 
(f) per time point per group. *P < 0.05, **P< 0.01 for the comparison as 
indicated or versus control. Scale bars, 25 tm. Error bars denote s.e.m. 


mass was increased in BPDGFRaTg mice at 3 months, and remained 
increased at 14 months compared to age-matched controls (Fig. 3b). In 
contrast, total pancreatic mass and islet architecture in BPDGFRaTg 
mice and controls were indistinguishable (Supplementary Figs 5c and 
6b). Thus, Pdgf signalling activation is sufficient to sustain adult B-cell 
expansion in vivo. 
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Figure 3 | Activated PDGFR-«. delays age-dependent Ezh2 loss and 
replication failure in pancreatic p-cells. a, Relative mRNA levels of human 
PDGFRA and mouse Pdgfra in islets from littermate control and BPDGFRaTg 
mice at 3 months of age. n = 3-7 mice per group. b-d, Pancreatic B-cell mass 
(b), and B-cell proliferation, assessed by Ki67 expression (c) or BrdU 
incorporation (d) in BPDGFR«KO and control mice at indicated ages. n = 3-5 
mice per group. e, f, Relative mRNA levels of Ezh2 (e) in islets, and 


immunostaining (f) for Ezh2, BrdU and insulin on pancreatic sections from 
control and BPDGFR«Tg mice at indicated ages. n = 3-7 mice per group. Scale 
bars, 25 tum. g, Postprandial blood glucose levels in mice at indicated ages. 

n = 14-24 mice per group per time point. h, Glucose tolerance assessed in 14- 
month-old control and BPDGFRaTg mice. n = 6 or 10 mice per group. 

*P < 0.05, **P < 0.01 in comparisons indicated. Error bars denote s.e.m. 
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Prior studies indicate that f-cell expansion stimulated by mitogens 
can be complicated by disrupted B-cell function and impaired meta- 
bolic control**. To assess the physiological impact of prolonged adult 
B-cell proliferation in BPDGFRaTg mice, we measured systemic 
blood glucose and insulin levels. Blood glucose levels during fasting 
and ad libitum feeding and responses to insulin challenge were indis- 
tinguishable in BPDGFRaTg mice and littermate controls (Sup- 
plementary Fig. 6c-e). During re-feeding after overnight fast, 
BPDGFRaTg mice had a 25% reduction of blood glucose (Fig. 3g), 
associated with a modest elevation of plasma insulin levels (Sup- 
plementary Fig. 6i), and both changes were sustained for up to 14 
months. After overnight fast, glucose clearance was enhanced in 
glucose-challenged 3- or 14-month-old BPDGFRaTg mice (Fig. 3h 
and Supplementary Fig. 6j). Thus, PDGFRaTg mice maintained 
normal glucose-regulated insulin release and glycaemic control, sug- 
gesting that B-cell function remained regulated in these mice. 


PDGF-induced f-cell expansion requires Ezh2 


To test whether PDGFR-«-induced B-cell proliferation and expan- 
sion required Ezh2, we generated mice permitting conditional 
inactivation’ of Ezh2 in BPDGFRaTg B-cells. Intercrosses generated 
RIP-Cre; Ezh2"!, R26-PDGFRa?*’’ mice (abbreviated to BEzh2KO- 
RaTg) and littermate or sibling RIP-Cre; Ezh2"*. R26-PDGFRo???Y 
mice (abbreviated to BEzh2HET-RaTg) and RIP-Cre; Ezh2 mice 
(abbreviated to BEzh2KO). Islet PDGFRA?®Y mRNA levels in both 
BEzh2KO-RaTg and BEzh2HET-RaTg mice were elevated and indis- 
tinguishable (Fig. 4a). By contrast, increased islet Ezh2 mRNA levels 
and f-cell Ezh2 protein were not observed in BEzh2KO-RaTg or 
BEzh2KO littermates (Fig. 4b and Supplementary Fig. 7), confirming 
Cre-mediated deletion of Ezh2 in these mice. As expected, B-cell BrdU 
incorporation and mass were increased in BEzh2HET-RaTg mice 
(Fig. 4c, d and Supplementary Fig. 7) compared to controls (either 
Ezh2", R26-PDGFRa?*Y or Ezh2"*, R26-PDGFRa*?" genotype). 
However, neither B-cell BrdU incorporation nor mass were increased 
in BEzh2KO-RaTg mice compared to controls (Fig. 4c, d and Sup- 
plementary Fig. 7). Consistent with these findings, we detected modest 
postprandial blood glucose reduction and increased blood insulin 
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Figure 4 | PDGFR-a promotes f-cell expansion through Ezh2. a,b, mRNA 
levels for human PDGFRA (a) and mouse Ezh2 (b) in islets from mice with 
indicated genotypes at 3-4 months of age. n = 3-5 mice per group. c-f, B-Cell 
BrdU incorporation (c), B-cell mass (d), postprandial plasma insulin (e) and 
blood glucose (f) levels in 3-4-month-old mice with indicated genotypes. Each 
dot in f represents a measurement from an individual mouse. n = 4-5 mice for 
cand d, and n = 5-18 mice for e and f per genotype. *P < 0.05, **P < 0.01. 
NS, not significant in comparisons indicated. Error bars denote s.e.m. 
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levels in BEzh2HET-RaTg mice, but not in BEzh2KO-RaTg mice 
(Fig. 4e, f). Thus, Ezh2 is required for B-cell expansion and metabolic 
changes from PDGFR-« activation in B-cells. 


B-Cell PDGFR controls Ezh2 via Erk and Rb/E2f 


To elucidate the signalling basis of altered B-cell proliferation in 
BPDGFRaTg, BPdgfraxKO and ageing wild-type mice, we investigated 
Pdgf signal transduction factors. Pdgf receptors activate signalling ele- 
ments’*, including the mitogen-activated protein kinase/extracellular 
signal-regulated kinase (Mapk/Erk), phosphatidylinositol 3-kinase 
(PI3K)/Akt and phospholipase PLC-y. Ezh2 induction in juvenile 
wild-type islets exposed to PDGF-AA was blocked by U0126, which 
inhibits Erk1/2 activation”’, but not by LY294002 or U-73122, inhibi- 
tors of PI3K or PLC-y signalling (Supplementary Fig. 8a). Similarly, 
increased B-cell BrdU incorporation in islets exposed to PDGF-AA 
was blocked by U0126, but not by LY294002 (Supplementary Fig. 8b), 
indicating that Erk mediates B-cell Pdgf signalling responses. We also 
detected increased phosphorylation of Erk1/2, but not phosphoryla- 
tion of Akt or PLC-y, in BPDGFRaT¢ islets compared to control islets 
(Fig. 5a, b). Immunohistology confirmed increased phospho-Erk1/2 
levels in B-cells of BPDGFRaTg islets compared to controls (Fig. 5c). 
PDGF/ERK signalling activates cell cycle regulators to stimulate pro- 
liferation**”*, and BPDGFRaTg islets had elevated levels of Ccnd1 
mRNA encoding cyclin D1, and increased B-cells with nuclear cyclin 
D1 and phosphorylated Rb protein (Fig. 5c-e and Supplementary 
Fig. 9a). In contrast, loss of B-cell Pdgf signalling in BPdgfraKO mice 
diminished islet levels of Ccnd1 mRNA, and reduced numbers of 
phospho-Erk1/2* and phospho-Rb* B-cells (Supplementary Fig. 
9b, c). Rb protein phosphorylation regulates association of the E2f1 
transcriptional activator and E2f4 transcription repressor with their 
targets**’’. We identified two conserved candidate E2f-binding ele- 
ments near the Ezh2 promoter (Supplementary Fig. 10), and chro- 
matin immunoprecipitation (ChIP) studies showed increased E2f1 
association and reduced E2f4 association with these Ezh2 elements 
in BPDGFRaT¢g islets (Fig. 5f, g). In wild-type mice, levels of 
phospho-Erk1/2 and phospho-Rb declined with age in islet B-cells at 
a tempo similar to declining Ezh2 and Pdgfra expression (Supplemen- 
tary Fig. 11a, b). ChIP studies also revealed that association of E2f1 with 
Ezh2 decreased, whereas association of E2f4 with Ezh2 elements 
increased in islets with age (Supplementary Fig. 11c, d). Thus, age- 
dependent attenuation of B-cell Pdgfr-o altered Erk and Rb/E2f sig- 
nalling, resulting in reduced Ezh2 expression. 

To verify that Rb/E2f signalling regulates B-cell Ezh2, we examined 
triple mutant ROSA-CreER™”; Rv. p130" f pl07" '~ mice (RbTriKO) 
lacking Rb family proteins”. After exposure to tamoxifen, B-cells in 
adult RbTriKO mice, lacking detectable Rb, p130 and p107, had 
increased Ezh2 protein and mRNA levels, compared to those from 
tamoxifen-exposed Ro‘ p130"", p107" '~ control littermates (Sup- 
plementary Fig. 12a-c). Islet ChIP studies revealed increased E2fl 
association, and reduced E2f4 association with Ezh2 cis-regulatory 
elements (Supplementary Fig. 12d, e). Consistent with these findings, 
BrdU incorporation in RbTriKO B-cells was increased by 15-fold, 
culminating in a 400% increase of B-cell mass and hypoglycaemia 
that progressively worsened until death (Supplementary Fig. 12f-h). 
Thus, Rb proteins constrain Ezh2 expression and adult islet cell 
growth. 


PDGF control of human f-cell proliferation 

To investigate the relevance of our studies to human f-cells, we 
assessed PDGF signalling and competence in human islets. Like in 
juvenile mice, PDGFR-« protein and phospho-ERK1/2 levels were 
readily detected in islets from young human donors (Fig. 6a, b). We 
also detected abundant phospho-RB-Ser 780 and EZH2 protein in 
B-cell nuclei from young islets (Fig. 6b). In contrast, PDGFR-« was 
only detected in non-f-cells of islets from adult human donors, with 
little to no detectable phospho-ERK1/2, phospho-RB or EZH2 in 
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Figure 5 | Pdgfr signalling governs Erk and Rb/E2f regulation of Ezh2 in 
islet B-cells. a, Western blot analysis of total and phosphorylated Erk1/2, Akt 
and PLCy in islet proteins from 3-4-month-old BPDGFR«T¢g and littermate 
control mice. Similar results were obtained from multiple independent 
experiments. b, Relative phosphorylated protein level compared to total protein 
was quantified by densitometry. c, Immunostaining for phospho-Erk1/2 
(pErk1/2), phospho-Rb-Ser 780 (p-Rb(Ser780)) and insulin in 3-month-old 
littermate control and BPDGFR«Tg pancreatic sections. Scale bar, 25 jum. 

d, Percentage of phospho-Rb-Ser 780° insulin® cells from the indicated mice 


B-cells (Fig. 6a, b), suggesting that PDGF signalling attenuation is a 
conserved feature of ageing islet B-cells. mRNA levels of EZH2, but 
not EZH1, were increased in juvenile human islets exposed to PDGF- 
AA, an effect blocked by Sunitinib or U0126 (Fig. 6c and Sup- 
plementary Fig. 13a). In contrast, basal EZH2 mRNA levels were 
lower in adult islets and not induced by PDGF-AA (Fig. 6c). 
Similar to juvenile mouse islets with activated Pdgf signalling, ChIP 
studies demonstrated PDGF-AA-stimulated association of E2F1 with 
the human EZH2 locus, an effect blocked by Sunitinib (Fig. 6d). Thus, 
evolutionarily conserved elements of the PDGF signalling pathway 
govern EZH2 expression and f-cell cycle regulators in human islets. 
Proliferating BrdU™ cells in juvenile islets stimulated by PDGF-AA 
were predominantly f-cells, verified by immuno-localization of 
BrdU~ nuclei within insulin® cells and PDX1* nuclei with confocal 
microscopy (Fig. 6f and Supplementary Fig. 13b, c). Compared to 
vehicle-exposed controls, juvenile islets exposed to PDGF-AA had a 
sixfold increase of B-cell BrdU incorporation, an effect eliminated by 
simultaneous exposure to Sunitinib or U0126 (Fig. 6e, f). In contrast, 
PDGF-AA did not alter B-cell proliferation in adult human islets 
(Fig. 6e, f). Thus, dynamic PDGF signalling competence may regulate 
declining B-cell proliferation in ageing human islets. 


Discussion 


The elucidation and control of mechanisms governing pancreatic 
B-cell proliferation could transform treatments for diseases like 
diabetes. However, prior attempts to expand islets by modulating 
extrinsic or intrinsic growth regulators have been bedevilled by limited 
proliferation with loss of defining B-cell features**, indicating disrup- 
tion of mechanisms that preserve fate in proliferating B-cells. The 
maintenance of cell fate and function in proliferating cells is the essence 
of epigenetic regulation. Here we focused on identifying the principal 
elements of a native signalling pathway regulating Ezh2, an essential 
epigenetic regulator of f-cell proliferation’’. We found that B-cell 
Pdgfr signalling was required for Ezh2 induction, to sustain both 
physiological and regenerative B-cell proliferation. 

During islet B-cell regeneration (this study), and in maternal islets 
in pregnancy (H.C. and S.K.K., unpublished results), we observed a 
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quantified by morphometry. n = 4 mice per group. e, Cyclin D1 (Ccnd1) or 
cyclin D2 (Ccnd2) mRNA levels in 3-month-old control and BPDGFRaTg 
islets. n = 4 or 6 mice per group. f, g, ChIP analyses of the Ezh2 and f-actin 
(Actb) loci with antibodies to E2f1 (f) and E2f4 (g) using the indicated 
amplicons (see Supplementary Information) in 3-4-month-old littermate 
control and BPDGFRaT¢ islets. n = 3-4 independent experiments per 
antibody with independent islet samples. *P < 0.05, **P < 0.01 for control 
versus BPDGFRa«Tg. Error bars denote s.e.m. 


twofold increase of islet Ezh2 mRNA levels. However, pathological 
loss of all Rb family proteins in RbTriKO islets led to a fourfold 
increase in Ezh2 mRNA, and a 15-fold increase in B-cell proliferation. 
RbTriKO mice became hypoglycaemic and developed other islet and 
metabolic phenotypes reminiscent of insulinoma pathogenesis. Thus, 
the physiological expansion of B-cells maintaining regulated function 
may require mechanisms limiting Ezh2 induction. Compared with p- 
cells from BEzh2KO mice, we also observed a modest enhancement of 
proliferation and expansion after PDGFR-« activation in B-cells lack- 
ing Ezh2. Insulin or prolactin, two mitogens capable of promoting 
B-cell proliferation'’*”’, did not induce P-cell Ezh2 expression in 
cultured islets. Thus, our findings here support the view that Ezh2- 
independent mechanisms regulate B-cell proliferation. Likewise, 
decline of B-cell proliferation in ageing BPDGFRaTg mice suggests 
that PDGF-independent pathways restrict B-cell expansion. 

Proliferative responses to PDGF decline in some ageing cells*®, and 
progenitor cell self-renewal is impaired by Pdgf signalling disrup- 
tion*!. However, the molecular basis for these observations and the 
roles of Pdgf signalling in islet B-cell growth and development were 
not established’”"*“*. Here we showed that loss of B-cell Pdgfr-o 
signalling accelerated [B-cell replication failure both in juvenile mice 
and in adult mice after conditional B-cell ablation. In vivo activation of 
B-cell PDGFR-« signalling increased Ezh2 expression and mitigated 
age-dependent decline of B-cell proliferation. Crucially, conditional 
Ezh2 inactivation in B-cells with activated PDGFR-« signalling pre- 
vented these changes. Thus, our conditional loss- and gain-of- 
function studies reveal how Pdgf signalling regulates age-dependent 
physiological B-cell proliferation and expansion. Our work also sug- 
gests that Mapk/Erk regulation links Pdgfr activation to Ezh2 induc- 
tion and B-cell proliferation, consistent with studies demonstrating 
Erk regulation of Ezh2 in cancer cells**, and Erk activation that 
accompanies mouse -cell expansion in pregnancy™. Further studies 
should clarify how Erk and other factors might integrate B-cell res- 
ponses to Pdgf or other mitogens”. 

Although the mechanisms underlying pancreatic [B-cell expansion 
in physiological settings may differ between species**”’, age-dependent 
restriction of B-cell proliferation is a conserved feature in mice and 
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Figure 6 | PDGFR-« regulates human £-cell EZH2 expression and 
proliferation. a, b, Representative images showing immunostaining for 
PDGFR-« (a), phospho-ERK1/2 (pERK1/2), phospho-RB-Ser 780 
(pRB(Ser780)), EZH2 (b) and insulin (a and b) on pancreatic sections from 
juvenile and adult human subjects. PDGFR-« was detected in juvenile B-cells 
(arrows) but not in adult B-cells (arrowheads). Scale bars, 25 tum. 

c-f, Assessment of effects on human juvenile or adult islets after exposure to 
PDGF-AA (50 ng ml ') for2 days, with or without Sunitinib (2 1M) or U0126 
(10 tM) co-treatment. c, Islet EZH2 mRNA levels after the indicated treatments. 


humans*”’. Here, we provide unprecedented evidence that a conserved 


signalling pathway restricts B-cell proliferation in mice and humans. 
We suggest that B-cell competence for signalling factors like PDGF can 
underlie dynamic islet responses to mitogens. Such age-dependent 
signalling competence might explain earlier findings of inconsistent 
human f-cell responses to other mitogens*®. PDGF ligands are mea- 
surable in serum, and regulation of such circulating signalling agonists 
may provide an additional level of growth control in competent islet 
B-cells. Prior studies correlated increased PDGFR and EZH2 expres- 
sion in human endocrine neoplasias**, and disrupted PDGF signalling 
in B-cells of humans with type 2 diabetes®”. Thus, our demonstration 
that PDGF signalling regulates EZH2, CDKN2A and P-cell proliferation 
in human islets combines several observations into a new molecular 
model for physiological and pathological B-cell growth. The factors 
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n = 3-5 independent experiments. d, EZH2 locus ChIP analysis with anti-E2F1 
antibody or IgG in human juvenile islets. n = 3-4 for E2F1, n = 2 for IgG. 

e, f, Human islet B-cell proliferation changes after the indicated treatments. 
Average percentage of BrdU" insulin” cells (e) was quantified by morphometry 
from sectioned islets immunostained (f) for insulin (green), glucagon (white) 
and BrdU (red). n = 3-6 independent experiments. Scale bar, 25 jum. 

g, Illustration summarizing how Pdgfr-o signalling regulates B-cell Ezh2 and 
proliferation by activating Erk/Rb/E2f pathways sensitive to the indicated 
inhibitors. *P < 0.05, **P < 0.01 as indicated. Error bars denote s.e.m. 


governing f-cell expression of PDGF signalling regulators like 
PDGFR-« remain unknown, but we speculate that the identification 
of such factors may be useful for modulating human B-cell growth and 
function in diabetes and cancer. 


METHODS SUMMARY 


Animal studies and mouse islet culture experiments. Details for various mouse 
strain generation, in vivo manipulation and measurement, mouse islet isolation and 
culture and FACS-based f-cell purification are provided in Supplementary Data. 
Histology, immunohistochemistry and molecular studies. Standard histology 
and immunostaining protocols for various proteins on pancreas or islet sections 
were described previously'’. B-Cell mass and proliferation were morphometrically 
quantified’’ by independent researchers blinded to sample identity. Details of these 
studies and real-time RT-PCR, western blot and ChIP analyses are provided in 
Supplementary Data. 
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Human pancreas and islet studies. Four juvenile and four adult pancreases, and 
eight juvenile and three adult individual islet preparations from healthy human 
deceased organ donors (mean age 3.0+0.6 years for juveniles; mean age 
44.3 + 5.3 years for adults) were used in this study. As in the mouse studies, 
similar procedures and experiments were performed on human pancreatic sections 
or islet cultures as described in Supplementary Data. 

Statistical analyses. Results are shown as mean + standard error of the mean 
(s.e.m.). Statistical analysis was performed by one-way ANOVA, two-way 
repeated measures ANOVA, or unpaired Student’s t-test, as appropriate, with 
significance set at P< 0.05. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Animals. Transgenic mice harbouring a floxed Pdgfra allele*® encoding Pdgfr-o 
were purchased from the Jackson Laboratory. This strain of mice was intercrossed 
with RIP-Cre mice’! expressing Cre recombinase from rat insulin 2 gene pro- 
moter elements to generate RIP-Cre, Pdgfra! mice (designated as BPdgfrxKO) 
and their littermate controls (including RIP-Cre, Pdgfrel”* mice designated as 
BPdgfroxHet) on a mixed 129/Sv; C57BL/6 genetic background. 

R26°PPGFRPM transgenic mice that harbour a mutated human PDGFRA?*?Y 
allele encoding a ligand-independent activated receptor targeted immediately 
after a JoxP-flanked transcriptional stop sequence at the ROSA26 locus have been 
described previously”. This strain of mice was intercrossed with RIP-Cre mice to 
generate RIP-Cre; R26-PDGFRA”*”Y mice (designated as BPDGFR«Tg) 
together with their littermate controls on a mixed 129/Sv; C57BL/6 genetic back- 
ground. Mice with specific expression of mutant human PDGFRA?*?Y, but 
lacking Ezh2 in B-cells (designated as BEzh2KO-RaTg), were generated from 
subsequent intercrosses between BEzh2KO”* and BPDGFRaTg mice. 

Tamoxifen-inducible, compound triple-mutant mice lacking retinoblastoma 
pocket proteins Rb, p130 and p107 (ROSA26-CreER™; Rb”; p130", p107”-, 
designated as RbTriKO) were produced in the Sage group (Stanford 
University). RbTrikKO mice harbour a germline-deleted p107 null allele 
(p1077-), and loxP-flanked Rb and p130 conditional alleles which can be excised 
by tamoxifen-sensitive CreER'? recombinase targeted into the ROSA26 locus”. 
Littermate mice (Rb; p130 p107-) lacking ROSA26-CreER™ were used as 
controls. To delete Rb and p130 alleles, RoTriKO and littermate control mice aged 
3-5 months received intraperitoneal injections of tamoxifen (Sigma, 1.5 mg per 
mouse per day) dissolved in ethanol/corn oil (Sigma) on five consecutive days. 

MIP-EGFP mice” were obtained from the laboratory of M. Hara. C57BL/6 
mice were purchased from the National Cancer Institute (NCI). All mutant mice 
used in this study were genotyped by PCR of tail genomic DNA for the human 
PDGFRAY®”’Y allele, mouse Pdgfra’, Ezh2/, Rv, p130/, p107 alleles, and trans- 
genes encoding EGEP, RIP-Cre or ROSA-CreER"” as described previously'*??8°, 
Mice used in this study were age- and gender-matched littermates including both 
sexes. All mice were housed in the animal facility of Stanford University on a 12-h 
light/dark cycle with ad libitum access to water and normal chow except when 
otherwise indicated. All animal experiments and methods were approved by the 
Institutional Animal Care and Use Committee of Stanford University. 
Physiological studies. We performed glucose physiology studies as previously 
reported’’. Briefly, at the ages or times indicated in the text and figures, or after 
tamoxifen administration, we monitored body weight and blood glucose levels in 
ad libitum fed mice, or after 16h overnight fasting, or 2h after re-feeding (‘2h 
postprandial’). We drew blood from tail vein punctures or from the aorta when 
mice were killed, for measuring blood glucose levels using a Contour glucose 
meter (Bayer), or for measuring plasma insulin levels by mouse insulin ELISA 
kits (ALPCO Diagnostics). 

Glucose tolerance tests were performed on mice after a 16 h overnight fast, and 
the blood glucose levels were determined immediately before (0) and 5, 10, 20, 30, 
45, 60, 90 and 120 min after intraperitoneal injection of glucose (2 gkg | body 
weight). For measurement of plasma insulin levels during these glucose tolerance 
tests, mice were injected with glucose at a dose of 3gkg ‘ body weight, and we 
collected tail vein blood at 0, 15 and 45 min after glucose injection followed by 
mouse insulin ELISA (ALPCO diagnostics). For insulin tolerance tests, mice were 
fasted for 6h then received intraperitoneal injection of 0.75 unitskg | of body 
weight of insulin (Sigma); glucose levels at 0, 10, 20, 30, 40, 60 and 90 min after 
injection were then measured. 

For detecting in vivo B-cell proliferation by BrdU incorporation, mice were either 
fed with water containing BrdU (1 mg ml’) or intraperitoneally injected with 
BrdU/PBS solution (50mgkg | body weight) for specified schedules as follows: 
BPdgfraxKO mice, 2 days of intraperitoneal injection of BrdU/PBS; BPdgfrakKO 
mice with STZ treatment, 1 week of BrdU water feeding; BPDGFRaTg or 
BEzh2KO-RaTg mouse studies, 6-7 days of BrdU water feeding; RbTriKO mice, 
4 days of BrdU water feeding. After BrdU chase, mouse pancreases were collected 
and subjected to immunohistology followed by morphometric analyses described 
later. 

STZ-induced diabetes. For measuring islet Pdgfra and Pdgfrb induction during 
STZ-induced diabetes, male wild-type C57BL/6 mice, aged 8 weeks, were injected 
intraperitoneally with 100 mg kg! body weight of freshly dissolved streptozotocin 
(STZ, Sigma) in 0.1 moll! sodium citrate (pH 4.5) or with the same volume of 
sodium citrate (Vehicle). Six days later, islets or pancreases were recovered from 
the mice for gene expression analyses by real-time RT-PCR or immunohistology 
studies. To test the roles of Pdgfra in islet regeneration in STZ diabetes, BPdgfrakKO 
or control mice (including BPdgfraHet), aged 7-8 weeks, were treated with a single 
intraperitoneal STZ injection at a dose of 150 mg kg” ' body weight. After injection, 
blood glucose values were measured on day 3 and day 6 in the first week, and 


thereafter once per week. After 2 weeks of STZ treatment, subsets of BPdgfraKO 
and control littermates were fed with BrdU-containing drinking water (1 mg 
ml ') for 1 week for BrdU incorporation analyses. Three days, 21 days and 58 
days after STZ treatment, subsets of mice were killed, and blood samples and 
pancreases were harvested for measurement of plasma insulin levels by ELISA, 
and for pancreatic B-cell proliferation and mass quantification following immuno- 
staining of insulin and BrdU. 
Islet isolation and culture studies. Mouse pancreatic islets were purified as 
described previously’* and used immediately for assays described in the text. 
To study Ezh2 induction in cultured islets exposed to mitogens, freshly isolated 
islets from 2-3-week old, and 7-12-month old C57BL/6 mice were recovered and 
equilibrated overnight at 37 °C, 5% CO, in RPMI-1640 medium supplemented 
with 0.5% fetal bovine serum (FBS), 0.2% bovine albumin (BSA) and 2% peni- 
cillin/streptomycin. Afterwards, islets were distributed into microplates or dishes, 
and treated with recombinant human PDGF-AA, prolactin or insulin (Sigma), 
with or without addition of pharmacological inhibitors, including Sunitinib 
(Selleck), Vargatef (Selleck), U0126 (Sigma), LY294002 (Sigma) or U-73122 
(Sigma) at the indicated concentrations. Two days later, islets were harvested 
for RNA or protein extraction for real-time RT-PCR or western blot analyses. 
To detect phosphorylation of Pdgfr-« by western blot, islets were exposed to 
PDGF-AA (with or without indicated pharmacological inhibitors) for 90 min 
before islet harvesting and protein extraction. To detect in vitro islet B-cell pro- 
liferation stimulated by PDGF-AA, a final concentration of 50 uM BrdU was 
introduced into each culture dish 24h before harvest, followed by standard 
histological processing. Immunostaining of BrdU and insulin was performed 
on processed islets, and B-cell proliferation rate was determined by quantifying 
percentage of insulin® cells with BrdU. Culture experiments of human islets 
were performed using a similar procedure described later. 
B-Cell purification by FACS. Freshly purified islets from MIP-GFP mice at 2 
weeks or 5 months of age were pooled, and dispersed into a suspension of single 
cells by incubation with 0.05% trypsin/0.53mM EDTA solution at 37 °C for 
10 min as described previously*’**. After dissociation, cells were washed three 
times in PBS containing 2% FBS, and re-suspended in FACS buffer (HBSS/0.2% 
BSA/1% HEPES) containing propidium iodide (to exclude dead cells). FACS was 
performed on a FACS Ariall (BD Biosciences), and MIP-GEP* -cells with 
>95% purity were harvested and immediately lysed for RNA purification and 
real-time RT-PCR analysis. 
Real-time RT-PCR and western blot analysis. Total RNA from freshly isolated 
or cultured islets after indicated treatments were purified using the Absolutely 
RNA miniprep purification kit (Stratagene) according to the manufacturer’s 
instructions. RNA concentration was measured with a RiboGreen RNA quan- 
tification assay (Invitrogen). One-step quantitative RT-PCR was performed and 
analysed using an ABI Prism 7300 detection system (Applied Biosystems) with 
TaqMan one-step RT-PCR Master Mix Reagents and appropriate amounts (10- 
100 ng) of islet total RNA as the template. We calculated the ratio of mRNA for the 
gene of interest to the amount of internal control mRNA of peptidylprolyl iso- 
merase A (cyclophilin A, PPIA), and then normalized the ratio for each gene to its 
median. Primer and probe sequences are listed in Supplementary Tables 1 and 2. 
For western blots, total islet protein was prepared from freshly isolated or cul- 
tured islets after indicated treatments as previously described’. Equal amounts of 
protein were resolved on SDS-PAGE and transferred to polyvinylidine fluoride 
membranes (Amersham Pharmacia) for immunoblotting with specific antibodies, 
including rabbit polyclonal anti-human Ezh2 (1:1,000, Millipore), rabbit polyclonal 
anti-phospho-PDGFR-« (Tyr 720) (1:1,000, Santa Cruz Biotechnologies), rabbit 
monoclonal anti-PDGFR-« (1:1,000, Cell Signaling), mouse monoclonal anti-B- 
actin (1:4,000, Sigma), rabbit monoclonal anti-Akt (1:1,500, Cell Signaling), rabbit 
monoclonal anti-phospho-Ark (Ser 473) (1:1,000, Cell Signaling), rabbit mono- 
clonal anti-p44/42 MAPK (Erk1/2) (1:1,500, Cell Signaling), rabbit monoclonal 
anti-phospho-p44/42 MAPK (Erk1/2) (Thr 202/Tyr 204) (1:1,000, Cell Signaling), 
rabbit monoclonal anti-PLCy1 (1:1,000, Cell Signaling) and rabbit monoclonal 
anti-phospho-PLCy1 (Tyr 783) (1:1,000, Cell Signaling). Signal was visualized 
using ECL detection (Amersham Pharmacia) on Kodak film after further incuba- 
tion with HRP-conjugated secondary antibodies. For phospho-PDGFR-o 
(Tyr 720) and PDGFR-« detection, signal was amplified by incubating the mem- 
brane with biotinylated anti-rabbit-IgG (1:2,000), followed with an incubation 
with HRP-avidin (1:2,000, Vector Laboratories) and then ECL visualization. 
Histology, immunofluorescence and immunohistochemistry. We performed 
standard histological paraformaldehyde fixation, paraffin-embedding and 
immunostaining protocol as previously described”’. Briefly, immunohistochemical 
analysis was performed on 5-|1m-thick sections of pancreatic tissues or islet sections 
after antigen retrieval (DAKO) and using the following primary antibodies: 
guinea pig anti-insulin (1:400, Sigma), mouse anti-glucagon (1:200, Sigma), rabbit 
anti-somatostatin (1:200, Dakocytomation), rabbit anti-pancreatic polypeptide 
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(1:200, Dakocytomation), rabbit anti-Ezh2 (1:100, Epigentek), rabbit anti- PDGFR- 
a (1:50, Novus Biologicals), sheep anti-PDGFR-B (1:100, Sigma), rabbit 
anti-phospho-PDGFR-« (Tyr720) (1:50, Santa Cruz Biotechnologies), rabbit 
anti-phospho-p44/42 MAPK (Erk1/2) (Thr 202/Tyr 204) (1:50, Cell Signaling), 
rabbit anti-phospho-Rb (Ser 780) (1:100, Cell Signaling), rabbit anti-cyclin D1 
(1:50, Cell Signaling), rabbit anti-cyclin D2 (1:50, Cell Signaling), mouse anti- 
PDX1 (1:50; gift from H. Edlund), mouse monoclonal anti-Ki67 (1:100, 
Novocastra) and mouse monoclonal anti-BrdU (1:100, Sigma). We detected 
immune complexes with secondary antibodies conjugated with either Cy3, 
fluorescein isothiocyanate (Jackson ImmunoResearch) or horseradish peroxidase 
(Vector Laboratories). After staining, images were directly collected on an AxioM1 
microscope equipped with a CCD digital camera (Carl Zeiss), or on a Leica SP2 
inverted confocal microscope. 

For measuring pancreatic B-cell mass or BrdU by morphometry, three to six 
mice for each group were analysed. At least five sections separated by more than 
300 tum (pancreases) or 50 kum (cultured islets) were immunostained and assessed 
for each sample. Images were analysed with an ImagePro program by observers 
blinded to genotype, and pancreatic B-cell mass and percentage of BrdU~ B-cells 
were calculated as described previously”. 

Pancreatic islet ChIP analysis. ChIPs were performed using a chromatin immu- 
noprecipitation kit from Millipore as described previously’’. Briefly, freshly iso- 
lated mouse pancreatic islets or cultured human islets after the indicated 
treatments were pooled, and fixed with 2% formaldehyde for cross-linking for 
20 min at room temperature (25 °C). After washing, islets were dissolved in SDS 
lysis buffer containing a proteinase inhibitor cocktail followed by sonication to 
shear the chromatin. Precleared chromatin from 150 to 300 islets was used for 
each ChIP sample with incubation of 1 to 5 1g of the appropriate antibodies at 
4 °C overnight. The antibodies used in the ChIP assays included mouse anti-E2F1 
(Millipore), rabbit anti-E2F4 (Santa Cruz Biotechnologies), rabbit or mouse 
polyclonal IgG (Millipore). After incubation, the immunoprecipitated chromatin 
DNA was harvested, cross-link reversed, and purified. After measurement of 
DNA concentration by PicoGreen DNA assay (Invitrogen), equivalent amounts 
of chromatin DNA from every sample was quantified by real-time qPCR in ABI 
Prism 7300 detection system (Applied Biosystems). The sequences of the PCR 
primers and probes are listed in Supplementary Table 3. 

Human pancreas and islet studies. Human pancreases or islets from organ 
donors were procured by arrangement with the National Disease Resource 
Interchange and the University of Alabama, Birmingham with appropriate consent. 
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Institutional review board approval for research use of tissue was obtained from 
Stanford University School of Medicine. For pancreas studies, fresh human 
pancreases were processed in our laboratory into paraffin sections using standard 
histological protocols. Four pancreases from juvenile donors (1, 2, 3, 7 years old) 
and four pancreases from adult donors (22, 34, 55, 56 years old) were used for 
immunohistology studies here. Similar immunohistology methods and antibodies 
as those used for mouse pancreas tissues were used to detect PDGFR-«, phospho- 
ERK1/2 (pERK1/2), phospho-RB (Ser 780) (pRB(Ser780), EZH2 and insulin on 
human pancreas sections. Experiments were repeated on multiple pancreas sections 
for every donor pancreas and similar results were obtained. 

For islet studies, human islets were isolated either at Pittsburgh University 
Medical Center (R.B.), or the University of Alabama, Birmingham (S. Bryant 
and A. Thompson). Eight independent human islet preparations from juvenile 
donors at the age of 8 months, 1.0, 1.5, 2, 3, 4, 5 and 6 years, and three adult islet 
preparations from donors at the age of 39, 48 and 56 years were used in this study. 
After isolation, fresh islets were shipped to our laboratory, and were hand-picked 
and transferred to RPMI-1640 medium supplemented with 0.5% FBS, 0.2% BSA 
and 2% penicillin/streptomycin for an overnight recovery before experiments. 
Similar to the experiments performed with mouse islets, equilibrated human islets 
were exposed to recombinant human PDGF-AA, without or with addition of 
pharmacological inhibitors, including Sunitinib (Selleck) and U0126 (Sigma) for 
2 days, followed by islet RNA purification for real-time RT-PCR analyses of gene 
expression. To detect in vitro human islet B-cell proliferation stimulated by PDGF- 
AA, human islets were exposed to 50 UM BrdU for 24h. Afterwards, human islets 
were processed in a similar way as in mouse islet studies for immunostaining of 
insulin, glucagon, PDX1 and BrdU. B-Cell proliferation rate was determined by 
quantifying percentage of insulin™ cells with BrdU~. More than 50 islets per group 
were counted for determining cultured human islet B-cell proliferation rate. 
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A mass transfer origin for blue stragglers in NGC 188 
as revealed by half-solar-mass companions 


Aaron M. Geller? & Robert D. Mathieu! 


In open star clusters, where all members formed at about the same 
time, blue straggler stars are typically observed to be brighter and 
bluer than hydrogen-burning main-sequence stars, and therefore 
should already have evolved into giant stars and stellar remnants. 
Correlations between blue straggler frequency and cluster binary 
star fraction’, core mass’ and radial position’ suggest that mass 
transfer or mergers in binary stars dominates the production of 
blue stragglers in open clusters. Analytic models*’, detailed obser- 
vations’ and sophisticated N-body simulations’, however, argue in 
favour of stellar collisions. Here we report that the blue stragglers 
in long-period binaries in the old® (7 x 10°-year) open cluster NGC 
188 have companions with masses of about half a solar mass, with a 
surprisingly narrow mass distribution. This conclusively rules out 
a collisional origin, as the collision hypothesis predicts a companion 
mass distribution with significantly higher masses. Mergers in 
hierarchical triple stars’ are marginally permitted by the data, but 
the observations do not favour this hypothesis. The data are highly 
consistent with a mass transfer origin for the long-period blue 
straggler binaries in NGC 188, in which the companions would be 
white dwarfs of about half a solar mass. 

The NGC 188 blue stragglers have a very high binary frequency 
(76 + 19% for binaries with periods of <10* d; ref. 10). The orbital 
period distribution is remarkable; 12 of the 16 blue straggler binaries 
have periods of order 1,000d, and all but two of the blue straggler 
binaries have periods longer than 100d. The two short-period blue 
straggler binaries show evidence that binary encounters were involved 
in their formation’®. We focus here on the ‘long-period’ blue straggler 
binaries, whose orbital solutions yield periods longer than 100 d. 

In Fig. 1, we show the companion mass distribution for the twelve 
NGC 188 blue straggler binaries with periods of order 1,000 d. The orbital 
solutions are derived from spectroscopic data'’”* obtained in the WIYN 
Open Cluster Study. Because we do not detect the flux from the compa- 
nions to these blue stragglers, the orbital solutions provide mass functions 
rather than mass ratios. We therefore use a statistical algorithm’ to 
convert the mass functions to the companion mass distribution shown 
here (Supplementary Information). The distribution is narrow and 
peaked with a mean of 0.53 solar masses (Ma) and a mode of 0.5Mo. 

Predictions for the companions to blue stragglers resulting from 
mass transfer in solar-type stars are well established by theory. Case- 
C mass transfer (from an asymptotic giant to a main-sequence star) 
leaves a carbon—-oxygen white dwarf companion in an orbit of order 
1,000 d and with a mass of about 0.5M5-0.6M , dictated by the core 
mass of the asymptotic giant donor at the end of the mass transfer 
phase’*””, This prediction is qualitatively reproduced by the NGC 
188 blue straggler companion mass distribution shown in Fig. 1. To 
check quantitatively for consistency, we compare the observed mass 
function distribution (points in Fig. 2) to a theoretical mass function 
distribution derived assuming that all companions have the typical 
carbon-oxygen white dwarf mass, 0.55M.q (solid line in Fig. 2). A 
Kolmogorov-Smirnov test shows that the theoretical and observed 
distributions are indistinguishable. 


Additionally, NGC 188 contains one blue straggler in a binary with 
an orbital period of about 120 d, the companion flux of which we also 
do not detect in our spectra. The observed mass function and orbital 
period for this system are consistent with the theoretical predictions of 
Case-B mass transfer (from a giant to a main-sequence star), where a 
helium white dwarf companion with a mass of about 0.25M,5-0.5Mo 
is expected'*”. Thus, the companions to all long-period blue straggler 
binaries in NGC 188 are consistent with a mass transfer origin. 

We investigate the predicted companion mass distribution from the 
collision hypothesis using a sophisticated N-body model of NGC 188 
that incorporates detailed stellar and binary evolution with stellar 
dynamics, and thereby produces blue stragglers through both colli- 
sions and mass transfer processes (Supplementary Information). The 


100 


80 


60 


40 


Frequency (%) 


° YY fh YY i 
(a0, 4 y 4 

: Yj WU 

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 


Companion mass (M .) 


Figure 1 | Companion mass distribution for the 12 blue straggler binaries in 
NGC 188 with periods of order 1,000d. The distribution is peaked near the 
typical carbon-oxygen white dwarf mass, of 0.55M., consistent with 
theoretical predictions from the Case-C mass transfer hypothesis. We use a 
statistical method’* to derive the companion mass distribution from the 
observed mass functions (Supplementary Information). To do so, we first 
estimate masses for the blue stragglers on the basis of standard stellar 
evolutionary tracks”, and assume an isotropic inclination distribution. The 
shaded histogram shows the resulting companion mass distribution and is 
normalized to show the frequency. The error bars show the 95% confidence 
intervals and are converted from the Poisson uncertainties in the mass function 
distribution using a Monte Carlo analysis. We note that standard evolutionary 
tracks may underestimate'’ the mass of blue stragglers by up to about 15%. 
Accounting for this potential bias does not change the results found here (nor in 
Fig. 2). For comparison, we also plot a standard initial-mass function for single 
stars’? (dotted grey line) for companion masses between 0.08M« and 1.1M.6 
(from the hydrogen-burning limit to the current main-sequence turn-off mass 
in NGC 188). The grey hatched histogram shows the observed tertiary mass 
distribution” evolved to 7 Gyr in isolation”. The error bars on the tertiary mass 
distribution show the standard Poisson counting uncertainties for each bin. 
The lowest-mass bin extends from 0.08M © to 0.2M. and the highest-mass bin 
extends from 1.0Mo to 1.1M.. Both of these bins are renormalized to reflect 
the different bin sizes. 
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Figure 2 | Cumulative distribution of mass functions of the NGC 188 blue 
straggler binaries with periods of order 1,000d. The black points show the 
observed mass functions derived directly from the kinematic orbital solutions’, 
defined as 
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where M, is the mass of the primary star (here the blue straggler), M, is the mass 
of the companion star and i is the inclination of the orbit to our line of sight. 
(Orbits that are edge-on have inclinations of 90°.) To test the blue straggler 
formation hypotheses, we compare this observed distribution with three 
theoretical mass function distributions, all derived using our blue straggler 
mass estimates and assuming isotropically distributed inclination angles. The 
solid line shows the resulting distribution assuming that all companions have 
masses of 0.55Mo, the typical carbon-oxygen white dwarf mass predicted by 
the Case-C mass transfer hypothesis. The dotted line shows the mass function 
distribution of the predicted companion masses for blue straggler binaries 
formed by collisions in the NGC 188 N-body model. Finally, the dashed line 
shows the distribution derived by drawing companion masses from the evolved 
tertiary mass distribution shown in Fig. 1. A Kolmogorov-Smirnov test rules 
out the collisional hypothesis at the >99% confidence level. Such a test does not 
rule out the merger hypothesis. However, there is only a 6.6% chance that all 
companions would be undetected in our spectra if drawn from the evolved 
tertiary mass distribution, and only a 1.8% chance that these companions 
would also realize the observed mass function distribution shown here. The 
observed mass function distribution is statistically indistinguishable from that 
predicted by the mass transfer hypothesis, and all white dwarf companions 
would be undetectable in our spectra, given their low luminosities. 


resulting distributions of companion mass, eccentricity and period for 
the blue stragglers in binaries that respectively formed through the 
collision and mass transfer mechanisms (Fig. 3) show marked differ- 
ences. We focus here on the simulated blue stragglers in binaries with 
periods of 100-3,000 d (matching the period range of our NGC 188 
long-period blue straggler binaries). 

The very narrow distribution and mean mass of (0.58 + 0.01)Mo 
for companions to the mass transfer blue stragglers in the N-body 
model are in good agreement with the population synthesis predic- 
tions discussed above. However, the mean mass of companions to 
collisionally formed blue stragglers is (1.11 + 0.02)Mo, nearly twice 
that of the mass transfer blue stragglers. This result demonstrates the 
finding that dynamical exchanges are more likely to insert a higher- 
mass member of the encounter into the binary’’. 

These differences in companion mass reflect profound differences 
in the evolutionary states of the companions resulting from each pro- 
cess. All of the simulated blue straggler binaries that formed through 
mass transfer processes have white dwarf companions, the remnants of 
the giant donor stars. Fewer than 1% of the blue stragglers in binaries 
that formed through collisions have white dwarf companions, whereas 
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80% have main-sequence companions (and the remaining have giant 
or blue straggler companions). 

A Kolmogorov-Smirnov test rules out at the >99% confidence level 
the hypothesis that the observed mass functions for the long-period 
NGC 188 blue straggler binaries are drawn from a parent population of 
collisional origin (dashed line in Fig. 2). Additionally, collision pro- 
ducts are predicted to have significantly higher eccentricities and 
longer periods than are observed for the NGC 188 blue stragglers (both 
at the >99% confidence level). We therefore rule out the hypothesis 
that the long-period NGC 188 blue straggler binaries have a collisional 
origin. 

Three of the long-period NGC 188 blue stragglers have measured 
orbital eccentricities consistent with circular orbits. By contrast, no 
NGC 188 solar-type main-sequence binaries in this period range have 
circular orbits. These blue straggler circular orbits are suggestive of a 
mass transfer origin. Rapid tidal circularization of the orbit during 
mass transfer has been a long-held expectation, as is seen in the pre- 
dicted eccentricity distribution for mass transfer products in the NGC 
188 model (Fig. 3b). 

However, observational and theoretical evidence suggests that mass 
transfer will not always lead to circular orbits. Proposed “eccentricity- 
pumping mechanisms’ address this issue and are under develop- 
ment'”*". Thus, the theoretical eccentricity distribution for blue strag- 
glers formed by mass transfer is uncertain. 

Fortuitously, the blue stragglers of the Galactic field provide a basis 
for empirical comparison’. Specifically, we compare the NGC 188 
blue stragglers with a blue straggler sample identified within the popu- 
lation of metal-poor thick-disk stars and halo stars, which is found to 
be coeval”’. These field blue stragglers probably formed in isolation, 
presumably through mass transfer processes”. In fact, the field blue 
stragglers are observed to have a high binary frequency, a period 
distribution that peaks near a few 100 to a few 1,000d (dotted line 
in Fig. 3c) and a mean companion mass consistent with 0.55Mo, all 
again consistent with a mass transfer origin. 

These long-period field blue straggler binaries show a range of 
non-circular eccentricities (dotted line in Fig. 3b). Enhancement of 
eccentricity through subsequent dynamical encounters cannot explain 
the non-circular orbits of field blue stragglers, owing to the low stellar 
density of the Galactic field. Therefore, if the long-period binaries 
among the field blue stragglers were formed through mass transfer, 
this is further evidence for the existence of an eccentricity-pumping 
mechanism. 

The eccentricity distribution of the long-period NGC 188 blue straggler 
binaries is shifted to higher eccentricities than that of the long-period 
field blue straggler binaries. However, owing to the higher densities in the 
cluster core, dynamical encounters may have increased the eccentricities 
of the NGC 188 blue stragglers. If we exclude from the analysis blue 
stragglers in NGC 188 within 1.5 core radii from the cluster centre, the 
two eccentricity distributions are statistically indistinguishable. The 
similarity in periods, companion masses and eccentricities would be 
a natural consequence of both the field and the NGC 188 long-period 
blue straggler binaries being formed by mass transfer processes. 

Finally, we investigate the possibility of blue straggler formation 
through mergers in hierarchical triples”**. The potential progenitors 
of the blue stragglers produced by this mechanism would be triples 
with short-period (=10-d) inner binaries’ having a total mass of 
12M5-2.2Mo, (2.2Me is twice the turn-off mass in NGC 188). 
The frequency of dynamically formed triples in the NGC 188 model 
with these orbital parameters is never high enough to contribute con- 
siderably to blue straggler production through this mechanism. If the 
merger mechanism is important, triples must form primordially with 
suitable orbital parameters in cluster environments. 

Observationally, the triple population in open clusters is poorly 
known, but triples are common in the field’®. Field triples in the 
Multiple Star Catalogue*® with measured masses indicating inner 
binaries with masses between 1.2Mq and 2.2Mo and inner orbital 
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Figure 3 | Distributions of binary orbital elements for the blue stragglers in 
the NGC 188 N-body model. a, Companion mass; b, orbital eccentricity; 

c, orbital period. The top plots show blue stragglers formed in collisions, and the 
middle plots show blue stragglers created through mass transfer (MT). The 
bottom plots compare the cumulative distributions for these two populations, 
with collision products shown in light grey and mass transfer products shown 
in dark grey. The sample contains all blue straggler binaries present from 
6-7.5 Gyr in the model, spanning the uncertainty in the cluster age. In a and 
b, we show only blue straggler binaries with periods between 100 and 3,000 d 
(the period range of our NGC 188 long-period blue straggler binaries). In c, we 
include blue straggler binaries of all periods (with the hatched regions 
indicating binaries at periods beyond our detection limit). Each bin is 
normalized by the total number of blue straggler binaries in the sample, and the 


periods of less than 10d have a nearly uniform tertiary mass distri- 
bution populated by main-sequence stars. However, the typical age of 
local field stars is 4 X 10° yr (4 Gyr) (ref. 27). If we evolve the tertiary 
mass distribution to 7 Gyr in isolation”*, 15% of the tertiaries evolve to 
become white dwarfs. The grey hatched histogram in Fig. 1 shows the 
resulting tertiary mass distribution at 7 Gyr. The mass distribution is 
qualitatively broader than that of the companions to the NGC 188 
long-period blue stragglers. However a Kolmogorov-Smirnov test 
comparing the mass function distributions does not rule out the triple 
hypothesis. 

The fact that we do not detect in our spectra the flux from compa- 
nions to any of the long-period blue straggler binaries also constrains 
the companion masses. The higher-mass main-sequence stars in the 
evolved tertiary mass distribution (Fig. 1) would be easily detected if 
these were the true companions to the long-period blue straggler 
binaries in NGC 188. A Monte Carlo analysis yields a 6.6% probability 
that all companions to the long-period NGC 188 blue straggler 
binaries would be undetected in our spectra if drawn from the evolved 
tertiary mass distribution, and only a 1.8% probability that these com- 
panions would also realize the observed mass function distribution of 
the long-period NGC 188 blue straggler binaries (Supplementary 
Information). Thus, mergers in hierarchical triples are not favoured 
by the observations, but the data are not sufficient to rule out this 
hypothesis completely. 

We aim to detect directly the flux from the white dwarf companions 
predicted by the mass transfer mechanism with forthcoming Hubble 
Space Telescope observations of the NGC 188 long-period blue 
straggler binaries in the ultraviolet. These observations will be invalu- 
able for distinguishing between the two remaining formation hypo- 
theses: binary mass transfer and mergers in hierarchical triples. 
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error bars show the Poisson counting uncertainties. Additionally, in the bottom 
panels of b and c we show the cumulative eccentricity (b) and period 

(c) distributions of the blue straggler binaries in NGC 188 (dashed lines) and 
the field’* (dotted lines). The N-body model predicts that collision products will 
have significantly more-massive companions, larger eccentricities and longer 
periods than the blue stragglers in NGC 188. However, the predicted 
companion masses and periods for mass transfer blue stragglers are consistent 
with those of the long-period NGC 188 blue straggler binaries. We note that the 
N-body model does not implement proposed eccentricity-pumping 
mechanisms that are required to reproduce the non-zero eccentricities 
observed for post-mass-transfer systems, including the field blue stragglers. 
Thus, the predicted eccentricities of the mass transfer products are uncertain, 
although they are still likely to be lower than those of the collision products. 
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Biomimetic self-templating supramolecular 


structures 
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& Seung-Wuk Lee’? 


In nature, helical macromolecules such as collagen, chitin and 
cellulose are critical to the morphogenesis and functionality of 
various hierarchically structured materials'’*. During tissue 
formation, these chiral macromolecules are secreted and undergo 
self-templating assembly, a process whereby multiple kinetic 
factors influence the assembly of the incoming building blocks to 
produce non-equilibrium structures’*. A single macromolecule 
can form diverse functional structures when self-templated under 
different conditions. Collagen type I, for instance, forms trans- 
parent corneal tissues from orthogonally aligned nematic fibres’, 
distinctively coloured skin tissues from cholesteric phase fibre 
bundles®’, and mineralized tissues from hierarchically organized 
fibres*. Nature’s self-templated materials surpass the functional 
and structural complexity achievable by current top-down and 
bottom-up fabrication methods’’”. However, self-templating has 
not been thoroughly explored for engineering synthetic materials. 
Here we demonstrate the biomimetic, self-templating assembly of 
chiral colloidal particles (M13 phage) into functional materials. A 
single-step process produces long-range-ordered, supramolecular 
films showing multiple levels of hierarchical organization and 
helical twist. Three distinct supramolecular structures are created 
by this approach: nematic orthogonal twists, cholesteric helical 
ribbons and smectic helicolidal nanofilaments. Both chiral liquid 
crystalline phase transitions and competing interfacial forces at 
the interface are found to be critical factors in determining the 
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Figure 1 | Schematic diagram of the phage-based self-templating process. 
a, Schematic illustration of the phage structure covered by about 2,700 o-helical 
major coat (pVIII) protein subunits having five-fold helical symmetry and a 
two-fold screw rotation axis (red and green arrows). The phage has a helical 
angle (#) of 41° with periodic subunit spacing of 3.3 nm. b, Schematic 


morphology of the templated structures during assembly. The 
resulting materials show distinctive optical and photonic properties, 
functioning as chiral reflector/filters and structural colour matrices. 
In addition, M13 phages with genetically incorporated bio- 
active peptide ligands direct both soft and hard tissue growth in a 
hierarchically organized manner. Our assembly approach provides 
insight into the complexities of hierarchical assembly in nature 
and could be expanded to other chiral molecules to engineer soph- 
isticated functional helical-twisted structures. 

As building blocks for the self-templating process we chose to use 
the bacterial virus M13 phage. Various viral particles have previously 
been used as model liquid crystal systems'*’* and to create functional 
materials for nanoparticle synthesis and assembly’””*, for energy 
storage’? and in tissue engineering”’”’. In particular, M13 was chosen 
for its helical, nanofibrous shape, for its monodispersity and for its 
ability to display multiple functional motifs (Fig. la). We then 
assembled the phage into large-area films by creating an apparatus 
to pull substrates vertically from phage suspensions at precisely con- 
trolled speeds (Supplementary Movie 1). As the substrates were pulled, 
evaporation proceeded fastest near the air—liquid-solid contact line 
resulting in the local accumulation and deposition of phage particles 
on the substrate at the meniscus (Fig. 1b). Two factors were critical to 
self-templating assembly. The first was the local induction of chiral 
liquid-crystal phase transitions at the meniscus; that is, twisted 
nematic (cholesteric) and chiral smectic structures (Supplementary 
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illustration of the helical self-templating assembly of phage particles controlled 
by competing interfacial forces at the meniscus where liquid crystal phase 
transitions occur. The polarized optical microscopy image shows iridescent 
colours originating from liquid crystal phase formation at the air—liquid-solid 
interface. 
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Fig. 1)’*"®. The second was the presence of competing interfacial forces 
acting at the meniscus, forces that determined liquid crystal orienta- 
tion, material flux and higher-order twist. Through the interplay of 
these factors during the film growth process we created novel, chiral, 
liquid crystalline structures, which we termed nematic orthogonal 
twist, cholesteric helical ribbon, and smectic helicoidal nanofilament 
(Fig. 2). 

We first varied the phage concentrations used for self-templating, 
because lyotropic liquid-crystal phase transitions are dependent on 
mesogen concentration. At a low concentration range (0.1-0.2 mg ml ~ ), 
phage particles formed nematically ordered fibre bundles with the 
director parallel to the pulling direction. In addition, periodic changes 
in particle flux to the meniscus due to stick-slip motion produced 
regularly spaced, alternating ridge (stick region) and groove (slip 
region) stripes perpendicular to the pulling direction (Supplemen- 
tary Fig. 2). At this concentration range, chiral phage interactions were 
not yet observed. At a higher concentration range (0.2-1.5 mg ml 3), 
twisted nematic structures began to form at the meniscus, resulting in 
the formation of alternating cholesteric/nematic ridges and grooves 
with pitch Pg throughout the sample during stick-slip motion 
(Fig. 2a, b). These structures, which we termed nematic orthogonal 
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Figure 2 | Self-templated helical supramolecular structures. a, Diagram of 
the nematic orthogonal twist structure. b, Scanning electron microscopy (SEM) 
image of a periodic, striped pattern. c, Atomic force microscopy (AFM) image 
depicting left-handed rotation of phage fibre bundles when progressing from a 
groove to ridge area. d, Diagram of alternating nematic grooves (1, black) and 
cholesteric ridges (2, red). e, Diagram of the cholesteric helical ribbon (CHR) 
structure. f, Photograph of CHR structures showing the curved meniscus 

(dashed red line), which induces the rolling of phage supramolecular structures 
(blue arrows). g, SEM images of the CHR structure on the left and right sides, 
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twist, had twisted nematic planes in which the phage bundle director 
rotated one-quarter of a cholesteric pitch (P/4) in a left-handed 
manner when it progressed from grooves to ridges (Fig. 2c). This 
indicates that the nematic was twisted whenever the meniscus was 
pinned to the substrate. Thus, the phage particles in the grooves aligned 
parallel to the substrate pulling direction, whereas those at the ridge 
tops aligned perpendicular to the pulling direction (Fig. 2d and 
Supplementary Fig. 3). The preferred 90° rotation may have resulted 
from the decrease in liquid crystal elastic energy in response to the 
confined geometry formed by the hybrid boundaries, as has previously 
been observed”. Therefore the nematic orthogonal twist structure was 
induced by the interplay of multiple kinetic factors including flux, 
liquid crystal transitions and elastic forces. 

When we tuned the phage concentration to 0.25-0.5 mg ml _' and 
the pulling speed to 10-20 1m min ', the cholesteric phase that 
formed at the pinned meniscus became noticeably influenced by forces 
originating from the curvature of the meniscus at the outer edges of the 
substrate (Fig. 2e, and dashed red line in Fig. 2f). The meniscus 
appeared as a three-dimensional curved geometric shape at the edges 
but as a two-dimensional shape in the middle. Because evaporation 
proceeded faster in the three-dimensional regions, liquid crystal phase 
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which show right and left handedness, respectively. h, AFM image showing that 
the CHR is composed of twisted cholesteric phase (red dashed lines indicate the 
cholesteric arch morphology). i, Diagram of the smectic helicoidal 
nanofilament (SHN) structure. j, k, AFM images of the SHN structure 

(j), which is composed of smectic C bundles (k). 1, Proposed model of the SHN 
structure composed of left-handed and right-handed helicoidal nanofilaments. 
m, Grazing-incidence small-angle X-ray scattering measurement 
perpendicular to the pulling direction, showing pseudo-hexagonally packed 
phage structures within the SHN. 
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transitions also occurred sooner in the same regions. As a result, these 
sites initiated macroscopic rolling of cholesteric phases that propa- 
gated by self-templating towards the centre of the substrate to form 
a ribbon-like structure, which we termed cholesteric helical ribbon 
(CHR) (Fig. 2f, g; the proposed mechanism of CHR formation is 
shown in Supplementary Fig. 4a). The rolling of the cholesteric axis 
led to structures that are distinctly different from previously identified 
doubly or triply twisted phases****. The underlying cholesteric struc- 
ture was observed as arches on the surface of the CHRs (Fig. 2h). Such 
arch textures are commonly observed in oblique cross-sections of 
cholesteric structures**”®. As a result of the mirror symmetry of the 
meniscus, the left side of the film formed a right-handed CHR (Fig. 2g, 
left side), whereas the right side formed a left-handed CHR (Fig. 2g, 
right side). The opposite-handed ribbons met in the middle to form 
grain boundary structures (Supplementary Fig. 4b-e). The CHR films 
showed higher levels of periodic order that can be characterized by 
additional pitches (Fig. 2g, h): a pitch P from the embedded cholesteric 
structure and a pitch P,,,, describing the supramolecular rotation. In 
addition, CHRs formed with periodic spacing Py. Although the phage 
particles are assembled with a left-handed cholesteric twist, this twist 
did not seem to alter the macroscopic rotation of the CHRs because 
there was no statistically significant difference between the pitches 
(Psup) of left-handed and right-handed CHRs (123.1 + 38.2 and 
128.1 + 36.1 um, respectively). 

In the concentration range 4-6 mg ml _', the deposited films formed 
periodic structures with even more levels of hierarchical order. 
Microstructural analyses of the films grown at 6mgml ' showed 
‘ramen-noodle-like’ structures, which we termed smectic helicoidal 
nanofilaments (SHNs) (Fig. 2i, j). The SHNs had layered, zig-zag 
morphologies with periodic pitch P,,, and alternating angles 0; and 
0,'. Meanwhile, tilted (02) smectic C bundles with fibre diameter D 
were stacked to form the length of each SHN (Fig. 2j, k and 
Supplementary Fig. 5). When comparing 0, and 0,’ values for the left 
and right sides of the film, 0; <0,’ on the left side of the film but 
0,>0,' on the right side (Supplementary Fig. 6). In addition, the 
layers with the larger angle were wider by about 40%, resulting in 
oppositely polarized grating structures across the grain boundary. 
We believe that these side-dependent differences are indicative of 
opposite-handed helical rotations of the nanofilaments caused by 
the opposing meniscus forces acting from the outer edges towards 
the centre (Fig. 21), as they did for the CHRs (details discussed in 
optical characterization below). When analysed by grazing-incidence 
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Figure 3 | Optical properties of the self-templated supramolecular 
structures. a, Polarized optical microscopy image (cross-polar) of the grain 
boundary (white dashed line) observed in a SHN structure. Oppositely 
polarized optical responses from the left and right sides of the film are visible. 
Inset: iridescence on the film. b, SHN films on a gold substrate constructed at 
different pulling speeds showing distinct colours. At faster pulling speeds, 
narrower phage fibres were formed (left, AFM images; right, schematic 
diagrams) with a shorter-wavelength colour. c, Phage films with alternating 
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small-angle X-ray scattering, SHNs showed pseudo-hexagonal packed 
crystalline structures exhibiting (100), (110) and (200) interparticle 
spacing peaks (Fig. 2m and Supplementary Fig. 7). 

All of the self-templated structures shown in Fig. 2 were tunable by 
varying parameters that affect the kinetics and thermodynamics of 
assembly such as phage concentration, pulling speed, ionic concentra- 
tion, phage surface chemistry, and substrate surface properties. For 
examples, the width, height and interspacing of the stick-slip patterns 
increased with increasing phage concentration, whereas increasing the 
pulling speed caused the height and interspacing to decrease (Sup- 
plementary Fig. 8). Variations in each parameter affected the meniscus 
forces, the friction forces and the meniscus shapes. These forces in turn 
induced the formation of a variety of chiral self-templated structures. 
The results also show that the higher-order handedness of self- 
templated structures can be controlled by external perturbations 
regardless of the inherent homochirality of the building blocks. 

Inspired by hierarchical structures in nature with characteristic 
colours and exquisite optical properties, such as blue-faced monkeys®, 
bird skins’ and beetle exocuticles”’, we investigated the optical and 
photonic properties of the self-templated structures. As discussed 
previously, SHN films pulled at slow speeds (about 15 um min’) have 
ordered, periodic grid-like structures (Fig. 3a). When spot-illuminated 
with white light, these films showed iridescence (Fig. 3a, inset). The 
oppositely polarized grating structure in these films suggested that the 
filament handedness was different on the left and right sides (Fig. 21 
and Supplementary Fig. 6). The sides did indeed have opposite optical 
responses when analysed by polarized optical microscopy (Fig. 3a and 
Supplementary Fig. 9), laser diffraction (Supplementary Fig. 10) and 
reflectance measurements (Supplementary Fig. 11), further supporting 
the notion that side-dependent handedness occurred in these films. 
Films fabricated with faster pulling rates (20-80 um min ~ ") also 
formed helical nanofilaments but with less lateral order, which resulted 
in distinctly coloured structures (Fig. 3b and Supplementary Fig. 12). 
Reflectance measurements showed that the characteristic optical 
spectra were correlated with the observed colours and that an increase 
in the pulling speed from 50,1m min ' to 80;1m min shifted the 
primary reflection wavelength position from 490 nm to 388 nm (Sup- 
plementary Fig. 13a). Meanwhile, atomic force microscopy (AFM) 
imaging revealed that as pulling speed increased, bundle diameter and 
interspacing decreased (Fig. 3b, right). Subsequent fast Fourier trans- 
formation analyses of the AFM images revealed that the observed colours 
were attributable to coherent scattering by the different periodicity of 
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bands of aligned microstructures (dashed arrows), showing orthogonal light 
switching responses. Photographs of the phage films deposited with alternating 
slow (20 pm min‘) and fast (1 mm min!) speeds on gold (top two images) 
and glass (bottom two images) substrates. On gold, reflection of incident light 
was selectively ‘on’ or ‘off depending on the direction of illumination (orange 
arrows). On glass, the films selectively filtered the three letters CAL underneath, 
depending on the direction of illumination. 


©2011 Macmillan Publishers Limited. All rights reserved 


each fibrous structure (Supplementary Fig. 13b). Similar structural 
colour generation often occurs in mammalian and avian skins®’. 
This mechanism differs from traditional liquid-crystal-based colora- 
tion, which relies on specific scattering from the cholesteric pitch”. 
Finally, by alternating the speed of film growth between slow and fast 
we made optical filters composed of alternating bands of SHN and 
nematically aligned structures. In both regions the phage particles were 
primarily oriented parallel to the pulling direction, but the SHN 
regions possessed additional perpendicular order in the form of a 
layered microstructure (Supplementary Fig. 14). The orthogonal 
microscale alignments caused selective reflection or transmission of 
visible light, depending on the direction of illumination (Fig. 3c). The 
results from our optical characterization demonstrate the potential of 
using a self-templating approach to construct tunable optical materials. 

Hierarchically organized extracellular matrices are crucial for con- 
trolling the formation of soft and hard tissues because they provide 
physical and biochemical cues to cells and act as templates for bio- 
mineralization’’, We therefore examined whether the structural 
hierarchy of our materials could have similar functions. We first 
engineered phages to display either RGD or EEEE peptides on their 
major coat proteins. The RGD motif promotes integrin-mediated cell 
adhesion, whereas negatively charged motifs mediate bone mineral 
formation in vivo and in vitro”*'. Using RGD-displaying phages, 
we fabricated films having the same alternating SHN and nematic 
hierarchical structures as our optical filters to act as tissue-guiding 
matrices (Fig. 4a). Cells cultured on these films recognized the under- 
lying microstructures, aligning their bodies and internal actin filament 
networks parallel to the long axes of the phage fibres on nematic 


Fast 


Figure 4 | Growth of biomimetic soft and hard tissue on the self-templated 
structures. a, Phase-contrast optical microscopy image of preosteoblast cells 
(MC3T3-E1) cultured on RGD phage films with SHN (top, slow pulling) and 
nematically ordered (bottom, fast pulling) structures. Insets: AFM images 
showing the directional guidance of cellular growth in response to the 
orientation of the underlying film microstructure. b, Composite fluorescence 
image showing that the pre-osteoblast cells (MC-3T3 E1) grew to confluence 
(day 3) in the desired directions on the RGD phage films. This image was 
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regions, but perpendicular to them on SHN regions (Fig. 4a, b and 
Supplementary Fig. 15; details in Supplementary Methods). Unlike the 
cell alignment induced by previous unidirectionally aligned tissue 
matrices (such as collagen, synthetic polymers or viruses)””?'***’, the 
structural ordering of the self-templated films guided cells in multiple 
directions in a hierarchically organized manner. Furthermore, to 
mimic the hierarchical formation of hard tissue, SHN films (RGD 
phages and EEEE phages (1:1 mixture)) (Fig. 4c) were treated with 
an apatite precursor solution containing Ca** and PO,’ ions. The 
films acted as a template for the biomineralization of calcium phos- 
phate, resulting in tooth-enamel-like organic-inorganic composites 
(Fig. 4d; details in Supplementary Methods). The Young’s modulus 
of the phage films was significantly increased (about 20-fold) after 
mineralization (Fig. 4e and Supplementary Fig. 16). Thus we reca- 
pitulated aspects of tissue formation by using self-templated materials 
composed of genetically engineered phages displaying specific func- 
tional motifs. 

We have shown that aspects of nature’s self-templating assembly 
strategy could be translated to a synthetic system through the con- 
trolled growth of phage-based films. Our system provides insight into 
how environmental factors control the kinetics during the conversion 
of helical macromolecules into higher-ordered structures in nature 
and how these same factors can be used to control assembly in the 
laboratory. Specifically, sub-micrometre-scale order was derived from 
liquid crystal phase transitions and their chiral twist; micrometre- 
scale order arose from forces associated with geometric constraints (con- 
fined spaces and curved menisci) and by periodic modulation of material 
flux (phage concentration and pulling speed); and centimetre-scale 
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merged from two fluorescence images taken at two different focal planes. 

c, SEM image of an RGD and EEEE phage (1:1 mixture)-based SHN surface 
before biomineralization. d, SEM image of the hard-tissue-like composite 
material mineralized with Ca** and PO,’ solutions. Blue overlays show the 
zig-zag structures of the composite, indicating that the hierarchical structures 
were preserved after biomineralization. e, Increase in the Young’s modulus of 
the phage films (about 20-fold) after biomineralization. Error bars indicate s.d. 
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order was induced by controlled meniscus movement. The power and 
versatility of self-templating is apparent, considering that a simple 
fabrication process produced an array of complex, functional struc- 
tures with many levels of hierarchical order from a single type of viral 
building block. Further development and tuning of synthetic self- 
templating systems hold promise for the development of advanced 
structural, optical and biomedical materials. 


METHODS SUMMARY 


Self-templating film fabrication. A home-built phage deposition apparatus was 
constructed by modifying a syringe pump. A C++ program was created to 
manipulate the motor speed between 0.1 tm min |! and 30mm min‘! controlled 
through an RS232C cable. Varying phage concentrations, ionic concentrations, 
pulling speeds and substrate surface chemistries were used for film fabrication. 
Gold-coated silicon substrates were used unless otherwise described. 

Genetic engineering of phages. M13 phages were genetically engineered to dis- 
play specific peptide motifs on their major coat proteins (pVIII). Target peptides 
were inserted near the amino terminus of pVIII. The constructed phage DNA 
sequences were verified at the University of California, Berkeley DNA sequencing 
facility. 
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Aerobic bacterial pyrite oxidation and acid rock 
drainage during the Great Oxidation Event 
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Olivier J. Rouxel**, Mark E. Barley®, Carlos Rosiere’, Phillip W. Fralick®, Lee R. Kump? & Andrey Bekker!? 


The enrichment of redox-sensitive trace metals in ancient marine 
sedimentary rocks has been used to determine the timing of the 
oxidation of the Earth’s land surface”. Chromium (Cr) is among 
the emerging proxies for tracking the effects of atmospheric oxy- 
genation on continental weathering; this is because its supply to the 
oceans is dominated by terrestrial processes that can be recorded in 
the Cr isotope composition of Precambrian iron formations’. 
However, the factors controlling past and present seawater Cr iso- 
tope composition are poorly understood. Here we provide an inde- 
pendent and complementary record of marine Cr supply, in the 
form of Cr concentrations and authigenic enrichment in iron-rich 
sedimentary rocks. Our data suggest that Cr was largely immobile 
on land until around 2.48 Gyr ago, but within the 160 Myr that 
followed—and synchronous with independent evidence for oxy- 
genation associated with the Great Oxidation Event (see, for 
example, refs 4-6)—marked excursions in Cr content and Cr/Ti 
ratios indicate that Cr was solubilized at a scale unrivalled in his- 
tory. As Cr isotope fractionations at that time were muted, Cr must 
have been mobilized predominantly in reduced, Cr(m), form. We 
demonstrate that only the oxidation of an abundant and previously 
stable crustal pyrite reservoir by aerobic-respiring, chemolithoau- 
totrophic bacteria could have generated the degree of acidity 
required to solubilize Cr(1m) from ultramafic source rocks and 
residual soils’. This profound shift in weathering regimes begin- 
ning at 2.48 Gyr ago constitutes the earliest known geochemical 
evidence for acidophilic aerobes and the resulting acid rock drain- 
age, and accounts for independent evidence of an increased supply 
of dissolved sulphate* and sulphide-hosted trace elements to the 
oceans around that time’’. Our model adds to amassing evidence 
that the Archaean-Palaeoproterozoic boundary was marked by a 
substantial shift in terrestrial geochemistry and biology. 

Most contemporary seawater chromium is sourced from the dis- 
solution of chromite [(Fe,Mg)Cr(im),O,4] in ultramafic rocks and 
soils'®. Cr(v1) is a mobile phase under neutral to alkaline conditions, 
but the only naturally important oxidants of dissolved Cr(!) at pH < 9 
are Mn(11)/Mn(rv)-oxides"’. Instead, most solubilized Cr(11) is rapidly 
re-precipitated as (Fe,Cr)(OH)3 at pH values >6, and so Cr tends to 
accumulate in modern soils as a Cr(i) residue"’; at lower pH, soluble 
Cr(im) species dominate, even at high redox potential’. In modern 
oceans, the chromate (v1) anion, CrO,”, is reduced to Cr(m) in the 
sediment via the action of Cr(v1)-reducing bacteria’’ and abiotically 
through reaction with Fe** or HS” (ref. 14). During much of the 
Precambrian, similar reductive reactions would have occurred within 
the ocean water column, owing to the widespread development of 
ferruginous and/or euxinic conditions'*"’. 

Given the efficient reduction of Cr(v1) and its subsequent quanti- 
tative removal as Cr(t11), it was suggested that the isotopic composition 


of Cr in Precambrian sediments should reflect the magnitude of 
oxidative weathering of Cr(m) on land and the flux of Cr(v1) to the 
ancient oceans*. Motivated by this suggestion, we have compiled a 
database of Cr concentration in Precambrian iron formations (banded 
iron formations, BIFs, and granular iron formations, GIFs) and in 
Phanerozoic ironstones and hydrothermal precipitates (Fig. 1a; 
Supplementary Fig. 1; Supplementary Table 1). As Cr can be associated 
with detrital input because of its general insolubility, we evaluate 
authigenic Cr enrichment on the basis of Cr/Ti ratios normalized to 
evolving crustal values (Fig. 1b). 

Our data show low Cr concentrations and a lack of large authigenic 
enrichment throughout much of the Archaean, although two excep- 
tions exist. The first, a newly analysed sample from the 3.7-3.8-Gyr- 
old Isua BIF in Greenland with high Cr content, is likely to be related to 
Cr enrichment accompanying detrital chromite contamination. The 
second, a cluster of Algoma-type iron formations about 2.8-2.7 Gyr 
old, show high Cr concentrations and non-negligible authigenic Cr 
enrichment. These Algoma-type iron formations formed close to sub- 
marine volcanic arcs and spreading centres, and represent proximal 
exhalative deposits. Furthermore, some of them are associated with 
iron ore bodies indicative of significant post-depositional hydro- 
thermal and oxidative alteration. Such caveats suggest that these 
Algoma-type iron formations cannot be used to infer Cr inputs from 
continental weathering. 

By contrast, laterally extensive Superior-type iron formation 
deposits developed in near-shore shelf environments (see, for example, 
ref. 17) and are well suited to track continental Cr fluxes. Superior-type 
iron formations show minimal Cr concentrations and enrichments 
throughout the Archaean, but around 2.48 Gyr ago values become 
elevated in near-shore facies from the Penge and Kuruman formations 
(100-270 p.p.m. Cr, Cr/Ti enrichments up to 6X crustal values). The 
majority of deeper-water, clastic-material-free facies from the same 
iron formations show Cr concentrations <50 p.p.m. and Cr/Ti ratios 
within 3 of crustal values. Between 2.45 and 2.43 Gyr ago, the Weeli 
Wolli and the Caué formations show more pronounced Cr concentra- 
tions and enrichments. Values then peak at 2.32 Gyr ago in the 
Timeball Hill Formation (>500p.p.m. Cr and Cr/Ti ratios up to 
10,000 crustal values). 

Not all iron formations in this time window show elevated Cr con- 
centrations or enrichments (for example, the Dales Gorge and Joffre 
members of the Brockman Iron Formation). Considering the low 
solubility of Cr(m) at marine pH (Supplementary Fig. 2), rapid reduc- 
tion of Cr(v1), and near instantaneous co-precipitation of Cr(1m) with 
ferric oxyhydroxide (Supplementary Fig. 3), Cr dispersal would be 
limited upon delivery to the oceans. In this regard, proximity of the 
depositional site to shore played a strong part in determining which 
iron formations would record a continental Cr input (Supplementary 
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Figure 1 | Chromium in iron-rich sedimentary rocks through time. 


a-c, Presented are secular trends in Cr concentrations (a), authigenic 
enrichment in Cr relative to Ti (b), and Cr isotope values replotted from ref. 3 
(c). Ina and b, squares denote bulk analyses, circles are laser ablation analyses, 
black represents Algoma-type iron formations intimately related to submarine 
volcanism, red denotes Superior-type iron formations that formed in 
continental shelf environments, blue indicates Proterozoic oolitic iron 
formations and Phanerozoic ironstones that formed in shallow marine waters, 
and green represents Phanerozoic hydrothermal and exhalative deposits. 
Increased Cr concentrations and authigenic enrichments between 2.48 and 
2.32 Gyr ago are related to increased Cr(iI) inputs to sea water, and correspond 
to intense acidity generated by oxidative weathering of an untapped crustal 
pyrite reservoir. In b, Cr/Ti ratios have been normalized to the evolving Cr/Ti 
ratio of upper continental crust (solid line) according to the restoration model 
of ref. 32. In ¢, solid and dashed lines represent the mean and 20 values, 
respectively, for Archaean and Palaeoproterozoic Cr isotope compositions 
from ref. 3. Significant positive Cr isotope fractionations in the Neoproterozoic 
indicate oxic Cr(v1) cycling and contrast with muted values in older iron 
formations representing a Cr cycle dominated by Cr(im). 


Information section 1). Indeed, iron formations with some of the 
highest Cr values, such as the Caué and Timeball Hill formations, show 
evidence of sediment re-working and grade into GIF and carbonate- 
facies, indicative of shallow-water deposition'*’. The oolitic Timeball 
Hill Formation (Supplementary Information section 2) records the 
highest degree of Cr enrichment in Palaeoproterozoic iron formations, 
which, according to our model, is not surprising considering its shallow 
water (pro-deltaic to offshore) deposition and unique position as a key 
formation in the chronology of the Great Oxidation Event (GOE)*. It 
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is important to note that similarly shallow-water Superior-type iron 
formations from the Archaean—such as in the 3.25-Gyr-old Fig Tree 
Group, the 2.96-Gyr-old Pongola Supergroup and the 2.71-2.70-Gyr- 
old Beardmore-Geraldton Greenstone Belt—all have minimal enrich- 
ment, indicating limited continental supply of authigenic Cr before 
2.48 Gyr ago. 

After 2.32 Gyr ago, most iron-rich sedimentary rocks have high Cr 
concentrations but waning authigenic enrichments consistent with 
increasingly prevalent detrital contribution. These observations are 
confirmed by Cr-Ti-Al systematics, which show that Cr enrichments 
are highest between 2.48 and 2.32 Gyr ago and are maintained regard- 
less of detrital input, whereas iron-rich sedimentary rocks younger 
than 2.32 Gyr ago typically have markedly higher Al,O; content and 
Cr/Ti ratios approaching that of the average continental crust (Fig. 2). 
Time-binned means and 90th percentile values attest to the statistical 
significance of the observations outlined above (Supplementary Fig. 
1c-f); considering all available data, the mean Cr/Ti ratio for the period 
2.48-2.32 Gyr ago is more than an order of magnitude greater than 
that for any other period, significant to over 10 standard errors. Data 
averaged by source and unit, time-binned averages, and a schematic 
diagram summarizing different Cr inputs with time and changing iron 
formation depositional environments, are all given in the Sup- 
plementary Information (Supplementary Figs 1 and 4). 

Authigenic Cr enrichments 2.48-2.32 Gyr ago indicate that a mech- 
anism was in place that could supply Cr from insoluble chromite on 
land. On the basis of a small positive Cr isotope excursion ~2.7 Gyr 
ago, Frei et al.* argued for the initiation of an oxidative weathering 
cycle that included the reaction of O, with Mn(i1) to form the MnO, 
catalyst required for Cr(1m) oxidation and Cr(v1) supply to the oceans. 
However, the high partial pressure of O2 required for Mn(11) oxidation 
at circumneutral pH (for example, ref. 20) strongly contradicts the 
preservation of sulphur isotope mass-independent fractionations 
(S-MIF) in the Archaean (for example, ref 21). Furthermore, the 
Archaean and Palaeoproterozoic iron formation Cr isotope values 
(0.28% to -0.26%p) fall almost entirely within two standard deviations 
about a mean of —0.09%o; this muted signal is in a strong contrast to 
Neoproterozoic values that reach up to +4.9%o and are more indi- 
cative of oxic Cr mobilization (Fig. 1c). The ~2.7-Gyr-old iron for- 
mations that show positive Cr isotope values (for example, Boston, 
Temagami and Cherry Creek Metasupracrustal Suite) are Algoma- 
type, closely linked to submarine volcanism and hydrothermal activity. 
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Figure 2 | Cr-Ti-Al systematics support the evolving nature of the marine 
Cr supply with time. In the Archaean (black circles), Algoma-type iron 
formations are closely linked to submarine volcanism and received limited 
clastic inputs (low Al,O3). In the period 2.5-2.3 Gyr ago (red circles), molar Cr/ 
Tiratios in iron formations significantly exceed crustal values, for both samples 
with high Al,O; (for example, Timeball Hill Formation) and low Al,O; (for 
example, Caué Formation). We propose that this period represents 
unparalleled Cr solubilization on land and authigenic Cr supply to the oceans 
coincident with the onset of oxidative weathering. After 2.32 Gyr (blue circles), 
molar Cr/Ti ratios approach modern crustal values and indicate a stronger role 
for clastic Cr input. Oxidative mobilization of Cr(v1) in the Neoproterozoic and 
Phanerozoic are reflected only by mild Cr/Ti enrichments in samples with low 
Al,O3 contents. 
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Their rapid precipitation from fluids in disequilibrium may lead to 
amplification of small non-redox-dependent kinetic isotope effects, 
making such iron formations poorly suited to resolving atmospheric 
oxygenation. In addition, there are a number of difficulties with invok- 
ing an oxidized Mn intermediate and an increased supply of riverine 
Cr(v1) to sea water in the Archaean (see Supplementary Information 
section 3). 

Although we agree that authigenic Cr enrichment in the early 
Palaeoproterozoic is linked to oxidative weathering, we propose an 
alternative model that does not require Cr(t) oxidation. It has been 
suggested that oxidative weathering of sulphide minerals (for example, 
pyrite) on land ~2.5 Gyr ago led to transiently increased delivery of 
chalcophilic trace elements and sulphate to the oceans’*. Yet what has 
generally been overlooked is that these reactions also generate signifi- 
cant acidity. In the Archaean, increased atmospheric partial pressure 
of CO, may already have acidified surface waters to pH values around 
4.7 (ref. 8), which would only be exacerbated by the acid generated by 
pyrite oxidation during the GOE”. That acid attack would have 
enhanced in situ dissolution of parent ultramafic/mafic material or 
Cr-bearing soil minerals that previously retained Cr under anoxic 
conditions, leading to increased marine Cr(1) supply: this increased 
supply could have been in dissolved form (CrOH*~ or Cr**, depend- 
ing on ambient pH), as Cr-enriched weathering products (for example, 
reactive clays and colloids), or perhaps, complexed with organic matter. 
Even today, Cr(1) can dominate riverine flux, and total riverine Cr flux 
increases with decreasing river pH (Supplementary Fig. 5). We suggest 
that the Cr enrichment beginning 2.48 Gyr ago reflects Earth’s first 
acid rock drainage, where acidity was generated with rising O2 at 
unprecedented scales through the oxidation of a previously untapped 
terrestrial pyrite reservoir. This process continued until the source of 
easily oxidizable pyrite diminished as the upper continental crust fully 
equilibrated with oxidized fluids. 

In modern oxic soils, pyrite oxidation and the generation of acid 
rock drainage is only efficient with Fe(1I) as the oxidant (see, for 
example, ref. 23). Accordingly, it is the ferrous iron to ferric iron 
oxidation reaction that is the rate-limiting step**; rate calculations 
show that in the absence of Fe() and at an atmospheric partial pres- 
sure of 0, <10 ° times the present atmospheric levels (Late Archaean 
estimate), inorganic pyrite oxidation is prohibitively slow, of the order 
of tens of thousands of years or greater for the dissolution of a 100 m° 
pyrite cube (Supplementary Information section 4). Today, pyrite 
oxidation is dependent on aerobic respiration by chemolithoauto- 
trophic bacteria capable of oxidizing Fe(11), as well as reduced sulphur, 
at O, concentrations <0.006 UM (ref 25). Thriving at low pH, they 
accelerate pyrite oxidation by several orders of magnitude and are 
wholly responsible for making pyrite oxidation self-propagating 
through concomitant production of dissolved Fe(11) as pH values drop 
below the threshold for inorganic Fe() oxidation (see, for example, 
ref. 26). Moreover, as very little energy is generated in the oxidation of 
ferrous to ferric iron, these bacteria must oxidize large amounts of 
Fe(11) in order to grow, and even a small number of cells can be 
responsible for generating significant acid rock drainage. Crucially, 
the acid that the bacteria produce drives weathering solutions to pH 
values below 4.5, at which point (Fe,Cr)(OH)3 becomes increasingly 
soluble. 

Cr(u) enrichments in iron formations may be used to constrain the 
pH of surface weathering solutions via a simple mixing model (Fig. 3). 
The model assumes that: (1) all Cr was sourced from a Cr-bearing 
weathering leachate mixed with river water, and precipitates quantita- 
tively on mixing with seawater; and (2) all Fe was sourced from sea 
water. Assumption (1) is well supported by the strong pH dependence 
and rapid kinetics of Cr(im) precipitation (Supplementary Figs 2 and 3) 
and is conservative because we assume no loss of Cr during transport. 
Assumption (2) is also conservative, in that any Fe addition from river 
water only increases the required acidity for a given river/marine 
mixing ratio. The mixing model confirms that shallow deposits such 
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Figure 3 | River/marine mixing model, indicating alteration pH necessary 
to supply sufficient Cr to account for Cr enrichment in the Timeball Hill 
Formation. The model is conservatively based on the lower end of Cr 
enrichment reported for the Timeball Hill Formation, specifically mean Cr 
(197 p.p.m.) and Fe (47 wt%) values”’. Black lines, mixing trajectories for Fe- 
bearing marine waters contributing a total of 10 p.p.m. Fe, and river water 
bearing different proportions of Cr leachate, in which Cr concentration is 
limited as a function of pH by amorphous Cr(OH); solubility according to 
Supplementary Fig. 2a. Light blue shaded area, plausible leachate contributions 
and mixing ratios that are realistic for the deposition of Timeball Hill 
Formation (Supplementary Information section 1.5). The green to yellow 
transition indicates the dependence of inorganic pyrite oxidation on pH, and 
the approximate pH threshold below which aerobic microbial Fe(11) oxidation 
is required to sustain pyrite oxidation. The mixing model confirms that aerobic 
microbial Fe(11) oxidation was required to account for the low alteration pH and 
Cr mobilization even for Timeball Hill Formation samples with the lowest 
degrees of Cr enrichment: our own measured Cr values are higher and thus 
push the lines to more acidic values. Lower limit for marine water was estimated 
by considering the reservoir size of continental runoff relative to the modern 
marine reservoir (data from ref. 33). 


as the Timeball Hill Formation should have been highly sensitive to the 
introduction of even small quantities of acidic, Cr-bearing continental 
fluids. As neither inorganic pyrite oxidation nor CO -driven weath- 
ering can produce levels of acidity below pH ~4.5 (see Supplementary 
Information section 4), Cr enrichment in iron formations 2.48- 
2.32 Gyr ago thus necessarily records acidic pyrite oxidation driven 
by the activity of aerobic Fe(1!)-oxidizing bacteria, and constitutes 
strong evidence for their emergence on land by that time. This sequence 
of events agrees well with molecular studies suggesting that acidophilic 
chemolithoautotrophs diverged from the last common ancestor of «-, 
B- and y-Proteobacteria around 2.5 Gyr ago”’. 

Apart from our Cr record in iron formations, independent geo- 
chemical constraints support a change in continental weathering by 
the early Palaeoproterozoic. For instance, despite similar parent 
lithologies, the 2.7-Gyr Mt Roe palaeosol has a Cr concentration more 
than five times greater than the 2.45-Gyr-old Denison palaeosol (Sup- 
plementary Table 2); the former also has higher and more homogen- 
ous Cr/Ti ratios, indicating a more conservative Cr behaviour. Both are 
characterized by Fe loss: the older example reflecting formation under 
an anoxic atmosphere”, and the younger, by our model, pointing to 
acidity associated with pyrite oxidation that would have prevented 
Fe(1I1) precipitation even if some oxygen was present. Additional evid- 
ence for intensification of chemical weathering on land is given by 
elevated values of the chemical index of alteration (CIA) and high 
Al/Ti ratios in early Palaeoproterozoic shales (Supplementary Informa- 
tion section 5). Indeed, the shales of the Timeball Hill Formation, 
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which shows the highest Cr enrichments in this study, have CIA values 
ranging from 97.9 to 99.7 (ref. 29). 

The Cr isotope record in iron formations (ref. 3) provides key evid- 
ence for our proposed model of Cr(im) transport during the GOE. 
Cr(m1) oxidation in the Archaean and early Palaeoproterozoic, if a 
factor, was unlikely to have been quantitative because of the still low 
partial pressure of O2 and high O2 demand for Mn(1) oxidation. As a 
result, large Cr isotope fractionations (of the order of the maximum 
theoretical fractionation associated with Cr(i) oxidation, up to 7%bo; 
ref. 30) would be expected in the dissolved pool if it was dominated by 
oxidized Cr(v1). However, minimal Cr isotope fractionation in iron 
formations during that time, despite pronounced Cr enrichments, 
requires transport of an isotopically unfractionated Cr load, which 
could only be igneous-rock-derived Cr(im). Iron formations in the 
Neoproterozoic provide a strong contrast to those in the Archaean 
and Palaeoproterozoic. Values of 8°*Cr in the ~715-Myr-old Rapitan 
Formation and the ~700-Myr-old Chuos Formation exceed Archaean 
and Palaeoproterozoic 5°°Cr values by a factor of three; by 560 Myr 
ago, the Yerbal Formation has 8°°Cr values more than 15 times higher, 
approaching 5%o, as expected for a Cr(v1)-dominated marine supply’. 
The most parsimonious explanation for the discrepancy in Cr systematics 
between Palaeoproterozoic and Neoproterozoic iron formations is a 
shift from Oy levels sufficient for the oxidation of some highly redox- 
sensitive elements (such as S and Fe) anda marine Cr supply dominated 
by Cr(im), to more oxidizing conditions permitting widespread Mn(II) 
and Cr(i) oxidation, accompanied by an increased role for Cr(v1). 
Authigenic Cr enrichments in the Neoproterozoic and Phanerozoic 
remain well below those observed at ~2.48-2.32 Gyr ago, despite oxic 
Cr(v1) input; we emphasize that the Palaeoproterozoic Cr excursion 
should be unique because of the ‘switch’ from anoxic to oxic regimes 
of terrestrial pyrite weathering. 

The present work has important implications for the timing, dura- 
tion and sequence of events associated with the GOE. It reveals the 
GOE as a protracted process, first expressed in the early signs of 
oxidative weathering manifested in our Cr record, as well as other 
proxies'*”, and culminating at 2.32 Gyr ago with the permanent loss 
of the S-MIF signal*. The evolution and activity of microbes is intimately 
linked with the redox evolution of the Earth’s surface, but as demon- 
strated here, sometimes in unexpected ways. Redox-sensitive elemental 
proxies for the rise of atmospheric oxygen need to be considered in the 
light of a wide gamut of possible biogeochemical influences beyond 
simple oxidation because, paradoxically, it is Cr cycling in the reduced 
form that tracks the onset of oxidative weathering. Intriguingly, the 
ability of cyanobacteria in the ocean’s photic zones to influence their 
own nutrient supply via O2-enhanced chemical weathering may rep- 
resent an unrecognized positive biological feedback. Moreover, unlike 
carbonic-acid-driven weathering, the increased delivery of phosphate 
from sulphuric acid weathering of apatite would be unaccompanied by 
increased alkalinity. This decoupling of nutrient and alkalinity fluxes 
would drive a proportional increase in organic carbon burial, a neces- 
sary condition for the generation of positive carbon isotope excursions 
(see, for example, ref. 31). The ~2.25-2.10-Gyr Lomagundi carbon 
isotope excursion may, together with the Cr enrichment documented 
here, be a manifestation of sulphuric acid driven weathering produced 
during the initial sedimentary cycle of oxidative weathering”. 


METHODS SUMMARY 


Cr, Ti and Al concentrations, representing more than 100 Precambrian iron for- 
mations and Phanerozoic ironstone/hydrothermal samples and consisting of 2,021 
separate data points, were amassed from previously published bulk analyses and 
from new analyses on both bulk and grain-by-grain (haematite and magnetite only) 
scales. Samples selected for new geochemical analyses include both drill core and 
hand samples from fresh exposures. Oxidized surfaces, veins and strongly recrys- 
tallized zones were avoided to minimize the effects of weathering and diagenetic or 
metamorphic overprinting. New analyses were performed at the University of 
Alberta using a Perkin Elmer Elan6000 quadrupole ICP-MS (inductively-coupled 
plasma-mass spectrometer) on powdered sample digests in solution mode and in 


372 | NATURE | VOL 478 | 20 OCTOBER 2011 


situ on epoxy mounts using a New Wave UP-213 laser ablation system. At the 
Woods Hole Oceanographic Institution, powdered samples were analysed in solu- 
tion mode using a ThermoElectron Element2 high-resolution ICP-MS. Instrument 
calibration was achieved using commercial standard solutions and verified against 
international glass standards NIST 610 and 612 at the University of Alberta and 
geostandards IF-G and BHVO-1 at the Woods Hole Oceanographic Institution. Cr 
isotope data presented by Frei et al. are replotted in Fig. 1c and reported in 6 
notation relative to NIST SRM 979. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Sample selection. In our previous work”, the interpretation of trace element 
signatures in iron formations as proxies of ancient sea water warranted a pre- 
selection of samples with minimal detrital influence based on chemical composi- 
tion. One method commonly used to differentiate detritally-sourced components 
versus authigenic precipitation is through measurement of incompatible elements, 
such as Al, Ti, Zr, Th, Hf and Sc**. However, both the low solubility of Cr(m) and 
the rapid reduction of Cr(iv) in the presence of Fe(11) makes it highly unlikely that 
Cr in iron formations is representative of bulk ocean water. Instead, the focus of 
this study was to ascertain Cr fluxes from land to the marginal marine setting, and 
in this regard, shallow water iron formations (as well as their Phanerozoic counter- 
parts, ironstones) are valuable proxies even if they include a significant detrital 
component. Thus, we present here all available iron formation and ironstone data. 
Only bulk-digest analyses of potential iron ores (possessing >50 wt% Fe, an upper 
boundary for iron formations”) or iron oxide-rich rocks that are believed to have 
formed through metasomatic processes (for example, iron oxide copper-gold ore 
deposits) have been excluded. Also, it is important to note that in our compilation 
of 2,021 data points sample preparation and analytical techniques varied; in order 
to avoid selective bias to our results, all available data were included. 

Cr analyses of iron formations. Data are presented in Supplementary Table 1. 
Iron formation and ironstone samples selected for new geochemical analyses 
include both drill core and hand samples from fresh exposures. Oxidized surfaces, 
veins and strongly recrystallized zones were avoided to minimize the effects of 
weathering and diagenetic or metamorphic overprinting. At the University of 
Alberta, samples for bulk major and trace element analysis were cut in slabs, 
broken into small chips (<5mm) without metal contact, and powdered in an 
automated agate mill. Crushed rock powders (<100 mesh) were dissolved with 
heated HF +HNO; and analysed using a Perkin Elmer Elan6000 Quad-ICP-MS 
(quadrupole inductively coupled plasma mass spectrometer). Accuracy and pre- 
cision of the analytical protocol were verified with the use of well-established 
international whole rock standards (for example, BE-N Basalt, CRPG Nancy; 
see http://research.eas.ualberta.ca/rif/quad_icp_ms.html). In situ trace element 
analyses were performed on polished epoxy mounts using the same Quad-ICP- 
MS connected to a laser ablation system operating with spot sizes of 20 to 60 um. 
Optimisation of ICP-MS parameters (RF power 1,200 W, peak hopping acquisi- 
tion, 50ms dwell time) were achieved by ablating either the NIST SRM 610 
(~400 p.p.m. Cr) or NIST SRM 612 (~40 p.p.m. Cr) international glass standards. 
For quantitative trace element determinations, the NIST 610 and 612 standards 
and iron formation/ironstone samples were ablated using identical conditions 
with spot sizes of either 20 um or 60 um, 5 Hz repetition rate and energy density 
of ~13Jcm ~. Quantitative results were obtained via the calibration of relative 
element sensitivities against the NIST 610 and 612 standards. Data reduction and 
calculations of concentrations were done using the GLITTER (XP version, New 
Wave Research) laser ablation software. Repeated analysis (n = 10) of the NIST 
612 using a 60 jm spot size yielded relative standard deviations of between 5 and 
15% (2c level) and a detection limit of 3 p.p.m. for Cr. 

At the Woods Hole Oceanographic Institution, clean samples of iron forma- 
tions were crushed between two Plexiglas discs inside a polypropylene bag using a 
hydraulic press. Rock chips were rinsed several times with deionized water during 
ultrasonication. The cleaned material was powdered in an agate shatterbox. Iron 
formation and ironstone sample powders were weighed in 5 ml Teflon beakers and 
dissolved using 2 ml of concentrated trace-metal grade HNO3 with 2 ml of con- 
centrated HF. After evaporation on a hot plate at 50 °C, complete dissolution and 


Fe oxidation were achieved by a second evaporation step using 4 ml of aqua regia. 
The dry residue was then dissolved in 4ml of 6 N HCl and 1 drop of HO, by 
heating at 40°C in a closed vessel. Organic carbon was not attacked by this 
procedure and was always present in trace amounts. Trace and major element 
analysis were performed on a ThermoElectron Inc. Element2 ICP-MS. Samples 
were spiked with 5 p.p.b. of internal In standard to correct for changes in instru- 
ment sensitivity during the analytical session. Unknown sample concentrations 
were calibrated against matrix-matched, multi-elemental standards prepared with 
Specpure plasma solution standards. Based on multiple analyses of randomly 
selected samples across multiple analytical sessions, accuracy for Cr was >95%. 
Analytical precision and accuracy were also checked by multiple analyses of the 
geostandards IF-G and BHVO-1. 

Cr isotope data presented by Frei et al.’ are replotted in Fig. 1c and reported in 6 

notation relative to NIST SRM 979. For Cr isotope analytical details and additional 
information regarding available data and samples, see Frei et al.’. 
River/marine mixing model. The mixing model was constructed using ther- 
modynamic constraints on Cr(1!) solubility and approximate calculations that 
make several simple assumptions as to the source and fate of Cr and Fe upon 
mixing of river and marine waters in a shallow, marginal marine setting. These 
assumptions include: (1) that all Cr is sourced from a soil leachate solution in 
which Cr concentration is governed by equilibrium with amorphous Cr(OH)3; (2) 
that in order to account for surface runoff and areas of pyrite-free catchment, the 
soil leachate solution constitutes some fraction (that is, 0.01 to 100% leachate 
contours in Fig. 3) of the total riverine input; (3) that all Fe is sourced from sea 
water containing 10 p.p.m. Fe; and (4) that any given iron formation Cr/Fe ratio 
represents a constraint that the model must respect through appropriate mixing of 
river and marine inputs and quantitative precipitation of both Cr and Fe. Data 
representing the lower end of mean Cr/Fe values reported for the Timeball Hill 
iron formation (197 p.p.m. Cr, 47 wt% Fe, molar Cr/Fe of 0.00046; ref. 29) was 
chosen as a conservative model target in terms of iron formation Cr/Fe ratio; data 
from this study indicate significantly higher values and even more acidic condi- 
tions during the deposition of the Timeball Hill iron formation. 

Cr(i1) solubility as a function of pH was calculated for a solution saturated with 
respect to amorphous Cr(OH)3, at 25 °C and in 0.01 M NaCl to mimic freshwater 
conditions, using the equilibrium modelling suite Visual Minteq version 2.61 
(ref. 37) with the default supplied database thermo_minteq.dat (Supplementary 
Fig. 2). For each pH examined in the mixing model, the question becomes what 
river/marine mixing ratio is required, for a given leachate/river mixture, to supply 
Cr and Fe in the correct proportion to satisfy the iron formation Cr/Fe constraint, 
based on 10 p.p.m. Fe in sea water and Cr(u1) in river water derived from pH- 
dependent solubility data. This approach can be considered conservative by the 
employment of the lower end of Cr/Ti enrichments for the Timeball Hill iron 
formation and by the fact that any Fe brought in by river water, any marine Fe 
concentrations above 10p.p.m., and any Cr lost during transport all serve to 
increase the required mixing ratio and thus indicate greater acidity. 
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The role of dyking and fault control in the rapid onset 
of eruption at Chaiten volcano, Chile 


Charles Wicks', Juan Carlos de la Llera?, Luis E. Lara? & Jacob Lowenstern! 


Rhyolite is the most viscous of liquid magmas, so it was surprising 
that on 2 May 2008 at Chaitén Volcano, located in Chile’s southern 
Andean volcanic zone, rhyolitic magma migrated from more than 
5 km depth in less than 4 hours (ref. 1) and erupted explosively with 
only two days of detected precursory seismic activity’. The last 
major rhyolite eruption before that at Chaitén was the largest 
volcanic eruption in the twentieth century, at Novarupta volcano, 
Alaska, in 1912. Because of the historically rare and explosive 
nature of rhyolite eruptions and because of the surprisingly short 
warning before the eruption of the Chaitén volcano, any informa- 
tion about the workings of the magmatic system at Chaitén, and 
rhyolitic systems in general, is important from both the scientific 
and hazard perspectives. Here we present surface deformation data 
related to the Chaitén eruption based on radar interferometry 
observations from the Japan Aerospace Exploration Agency 
(JAXA) DAICHI (ALOS) satellite. The data on this explosive 
rhyolite eruption indicate that the rapid ascent of rhyolite occurred 
through dyking and that melt segregation and magma storage were 
controlled by existing faults. 

Chaitén volcano is situated on a forearc sliver between the Chile- 
Peru subduction zone and the Liquife-Ofqui fault zone (LOFZ; 
Fig. 1). The LOFZ is a long-lived shear zone* that has accommodated 
transpressional (northward translational and eastward compressional) 
movement of the forearc sliver for at least the past ~4 Myr (refs 4, 5). 
Chaitén has a caldera roughly 3km in diameter that is part of the 
Michimahuida-Chaitén volcanic complex (Fig. 1). The dacitic to 
basaltic Michimahuida volcano, located in the LOFZ about 15 km east 
of Chaitén, has a larger caldera about 8 km in diameter that probably 
formed in the Early Holocene and was subsequently filled by dacite 
and later basalt effusions®. 

Although the study area is heavily vegetated and the time intervals 
between acquisitions are long, the 23.6-cm wavelength of the radar 
instrument enabled us to calculate useful interferograms (Fig. 1b-d 
and Supplementary Figs 1 and 2). In 2010 JAXA acquired descending- 
mode radar data, enabling the calculation of descending-mode inter- 
ferograms; the most coherent one is shown in Fig. 1d. Together the two 
ascending-mode interferograms in Fig. 1b,c span the time interval 
covered by the descending-mode interferogram shown in Fig. 1d. 
The descending-mode interferogram displays a broad area of move- 
ment (about 50cm maximum) away from the satellite that encom- 
passes the entire Michimahuida—Chaitén volcanic complex; it is not 
visible in the ascending-mode interferograms. The radar beams from 
both modes intersect the Earth’s surface at an angle of about 39° from 
the vertical. In the descending mode the satellite looks to the WNW, 
whereas in the ascending mode it looks to the ENE (red arrows in 
Fig. 1b-d). Consideration of the radar geometry, shown schematically 
in Fig. le, yields two important points: first, the deformation source 
responsible for the broad deformation signal is not an equidimensional 
volumetric source, and second, it should be a sill-like collapsing source 
that dips to the ENE with a dip angle that is nearly complementary to 
the radar incidence angle in the ascending geometry. 


Assuming an Earth model that is a uniform homogeneous elastic 
half-space, we determined that three separate deforming bodies were 
required to fit the data: first, a collapsing sill-like body (the ‘reservoir’) 
dipping to the ENE (that is, seeming to feed magma diagonally 
upwards from about 20km depth beneath Michimahuida); second, 
a near-vertical expanding dyke trending to the ENE and extending 
from the deeper reservoir to the surface at Chaitén volcano; and third, 
a collapsing steeply dipping lath-shaped body (the ‘conduit’) sur- 
facing near the west rim of the Chaitén caldera (Fig. 2). The dip of 
the reservoir is between 31° and 57° (Supplementary Fig. 3) to the 
ENE and the depth of the reservoir directly below Chaitén volcano is 
between 8 and 12km. Apparent overshoot of the reservoir up-dip 
from the conduit is supported by modelling (see Methods) and by 
the presence of a string of NNW-trending undated rhyolitic plugs 
(the most prominent being Morro Vilctin) located where the reservoir 
fault should reach the surface (Figs 2a, b and 3a). This indicates that 
rhyolite might have migrated the length of the reservoir fault during a 
past eruption episode. The maximum amount of collapse on the 
reservoir is not well determined but can be roughly bounded. In 
Fig. 2 we show two equally viable models with a variable distribution 
of collapse on the reservoir. The model shown in Fig. 2a, b is the result 
of constraining the maximum collapse to be 2 m, whereas the model 
shown in Fig. 2c, d and Supplementary Fig. 4 is the result of con- 
straining the maximum collapse to be 8m. The fit to the data is 
essentially identical for each model and for all intervening values of 
maximum collapse, with a variance for the residual of about 3 cm? for 
each model and with each model accounting for about 98% of the 
data variance (Supplementary Figs 1, 2 and 5). The two models in 
Fig. 2 represent loose bounds on a family of best-fit models with a 
maximum collapse between 2 and 8 m on the deep collapse body; the 
variance of the misfit increases for a maximum collapse of less than 
about 2 m or greater than about 8 m. As of about March 2010, the net 
decrease in volume represented by the three sources in the model is 
(150-250) X 10° m?. The estimated volume of material erupted dur- 
ing the May 2008 Plinian eruption’® plus the new dome’ is about 
(700-900) X 10° m?. It is tempting to attribute this difference to 
magma recharge; however, this observed relationship between change 
in reservoir volume and erupted volume is an expected consequence 
of magma compressibility’. 

Because of a lack of nearby seismic instrumentation, earthquake 
locations during the beginning phases of the eruption are not useful 
for additional constraints on the deformation sources. However, 
centroid moment tensor (CMT) solutions, recorded for the largest 
earthquakes, are consistent with our model. Only two earthquakes 
in the study area were large enough for CMT solutions to be calculated 
at the Global CMT Project (http://www.globalcmt.org) (Table 1 and 
Fig. 1d). CMT 1 (Fig. 1d and Table 1) is the solution for an earthquake 
of moment magnitude (M,) 5.2 that occurred about 2h before the 
main explosive event at 08:00 UT on 2 May 2008 (ref. 2). CMT 2 (Fig. 1d 
and Table 1) is the solution for a M,,5.0 earthquake that occurred 
about 19h after the onset of the main explosive event. Because each 
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Figure 1 | Tectonic setting for Chaitén Volcano and ALOS/PALSAR 
interferograms from two ascending image pairs and one descending pair. 
All figures in this paper were generated with GMT” open-source software. 

a, Tectonic plates (after ref. 28) are Nazca (NZ), South American (SA), Scotia 


(SC) and Antarctic (AN). NZ converges on SA at a rate of about 68 mm yr! 


(ref. 29) as shown by the white arrow. The approximate rupture area of the 
M,9.5 1960 earthquake” is labelled. The detailed study area in Fig. 2 is 
indicated by the red box. b, Interferogram from ascending path 121 orbits 
09599 (2007 November 13) and 21677 (2010 February 18) with a perpendicular 
baseline (B , ) of 130 m. The direction of the satellite trajectory is shown by the 
red arrow labelled ‘Az’ in each panel. The line-of-sight look direction from the 
satellite to the ground is shown with the red arrow labelled ‘LOS’. The black line 
labelled LOFZ marks the main strand of the LOFZ. Dashed black lines mark 
Michimahuida caldera (after ref. 6) and the smaller Chaitén caldera. 

c, Interferogram from ascending path 121 orbits 10941 (2008 February 13) and 
23019 (2010 May 21) with a B, of 404m. d, Interferogram from two 
descending path 417 orbits 09868 (2007 December 1) and 21946 (2010 
March 8) with a B, of 41m. The CMTs in Table 1 are plotted with the 
estimated two standard deviation error bars (http://www.globalcmt.org). 

e, Schematic explanation of the variable visibility of the deformation field 
between ascending and descending interferograms. Earth’s surface is the 
dashed black line, a ray from the descending radar beam is labelled ‘Desc-LOS’ 
and a ray from the ascending radar beam is labelled ‘Asc-LOS’. The collapsing 
sill-like body is red and the resulting deformation is depicted with red arrows. 


CMT solution includes a large compensated linear vector dipole com- 
ponent (CLVD), and because of the timing of the earthquakes in an 
eruption setting, we interpret the CMT solutions to be a magma injec- 
tion (dyking) event (CMT 1) and a reservoir collapse event (CMT 2). 
The CLVD contribution to the total moment of an earthquake is 


Table 1 | Centroid moment tensor solutions from Global CMT Project 
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Figure 2 | Two deformation models that can be considered as near 
endmembers in a family of ‘best-fit? models. Each model is composed of three 
planar bodies of distributed plane perpendicular opening or closing: a deep, 
dipping reservoir, a dyke and a narrow lath-like conduit (as discussed in the text). 
The outline of Michimahuida and Chaitén calderas and the approximate location 
of the LOFZ are shown (as discussed in Fig. 1). a, Plan view of the best-fit model in 
which the peak collapse in the deep body is constrained to 2 m. The surface trace of 
the dyke is red and the surface trace of the lath-shaped body is cyan. The plane 
containing the deep, dipping reservoir is projected to reach the surface at the 
dashed blue line. It dips about 40° to the ENE and is about 10 km deep below 
Chaitén. The red triangle near the dashed blue line shows the location of Morro 
Vilcun. The black line through the two calderas shows the location of cross-section 
A-A’ in Fig. 3. b, Perspective view of the model shown in a viewed from the NE. 
c, Plan view of the best-fit model in which the peak collapse in the deep body is 
constrained to 8 m (see also Supplementary Fig. 4). The red, cyan and dashed blue 
lines are as described in a. The reservoir dips about 45° to the ENE and is about 
12 km deep below Chaitén. d, Perspective view of the model shown in c. 


measured by the value ¢ in Table 1, where ¢=|2A;//,| and the 
eigenvalues of the moment tensor are A, Az and A3 (|A,| = |A2| = |As]). 
An é value of 1.0 is a CMT with only a CLVD, and a value of 0 is a 
conventional shear-failure double-couple mechanism (see, for 
example, ref. 10). It has been shown” that a m, = 5.5 magma-injection 
earthquake, under the assumption of no net change in volume (or a 
trace of zero, as is assumed for Global CMT Project CMT solutions), 
could be best represented by a CLVD. CMT 1 is consistent with a near 
vertical dyke that trends to the ENE, the same as our modelled dyke 
(Fig. 3). CMT 2 is consistent with a collapsing sill-like body that dips 
roughly 42° to the ENE, the same as our modelled deep sill-like res- 
ervoir (Fig. 3 and Supplementary Fig. 3). As noted previously’, seismic 
data can be used to determine an accurate earthquake mechanism, but 
accurate earthquake locations cannot be determined without instru- 
mentation close to the source. The location and depth of CMT solu- 
tions have been shown to be biased by tens of kilometres in remote 
locations’*"*. In Fig. 3 we plot the CMT mechanisms projected in the 
cross-section geometry with the realization that depths are probably 
biased. 

Further support for the presence of the lath-like conduit was pre- 
sented previously’*: a network of seismometers was deployed in the 
area of Chaitén volcano for 11 months from December 2004 to 


CMT (Figs 1d and 3) Origin time (UT) Latitude Longitude Depth (km) My Eigenvalues Ay, Ao, 43 & 
1 2008/05/02 05:51 42.93°S 72.65°W 17.4 5.2 1.10, —0.74, —0.36 0.65 
2008/05/03 02:36 42.82°S 72.56°W 12.0 5.0 —5:68, 3.58, 2.10 0.74 
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Figure 3 | Schematic interpretation of the involvement of deformation 
sources in the 2 May 2008 eruption of Chaitén. The location of cross-section 
A-A’ is shown in Fig. 2a; the positions of Morro Vilcun (MV), Chaitén (Ch) 
and Michimahuida (M) are labelled on the elevation profile. The dashed black 
lines show the approximate location of the LOFZ and inferred fault splays. The 
salmon-coloured area is the rhyolite reservoir that grades into a zoned magma 
chamber beneath Michimahuida. a, The blue box beneath Chaitén shows the 
approximate location of the cluster of seismicity located previously’*. The 
dipping rhyolite reservoir and the lath-like conduit are discussed in the text. 
The seismicity, beginning on 30 April 2008 (ref. 2) indicates the initiation of 


November 2005. This temporary network enabled the location of a 
cluster of seismicity directly beneath Chaitén volcano that spanned a 
depth interval from the top of our inferred dipping reservoir to about 
5.5 km depth (Fig. 3a). This seismicity coincides with the location of 
the narrow conduit in our model and might indicate ongoing inflation 
during the buildup to the eruption. One of the earthquakes roughly 
8.5 km beneath Chaitén was large enough (M,, 3.56) for a moment 
tensor solution to be determined"*. The CLVD component could not 
be resolved, but the geometry was roughly consistent with strike-slip 
movement on either the fault surface containing the narrow lath-like 
conduit or a possible fault surface containing the dyke. 

The nature of the inferred faults in Figs 2 and 3, west of the LOFZ, 
might be similar to those defined in recent studies*” that outlined pop- 
up structures at latitudes immediately south of the Michimahuida- 
Chaitén volcanic complex. The pop-up structures are defined by a 
structural high centred on the LOFZ that is bounded on the west by 
east-dipping reverse faults and on the east by west-dipping reverse 
faults that merge at depth with the LOFZ. Vertical shear structures 
are found between the reverse faults. Alternatively, the inferred faults 
could be part of a fault complex that occurs between two separate 
north-south branches of the LOFZ*: one branch is as shown in the 
figures and the other branch could extend northwards into the study 
area beneath the Golfo Corcovado. We interpret the dipping sill-like 
reservoir as containing rhyolitic magma that occupies an ENE-dipping 
reverse fault that is related to the LOFZ. If this is so, the rhyolite 
reservoir would tap a zoned magma body"® under Michimahuida that 
is the site of basalt/andesite influx. The reverse fault is an effective path 
for magma migration and the segregation of low-density melt. Fault 
involvement and tectonic control in the Chaitén eruption have been 
proposed previously’””. 

Petrological experiments’ on the pumice ejected during the initial 
Chaitén Plinian eruption indicate that the material came from a depth 
of about 5-9 km in less than about 4h. This depth range is consistent 
with a magma source from the deep, dipping, sill-like reservoir beneath 
Chaitén. The earthquake for which a dyking CMT mechanism was 
recorded (CMT 1: Figs 1d and 3b and Table 1), just over 2 h before the 
main Plinian eruption, indicates that the rapid transport of magma 
was accomplished through dyking. After the dyking event, rhyolite 
magma was rapidly evacuated from the reservoir until it partly col- 
lapsed, causing the earthquake represented by CMT 2 (Figs 1d and 3c 
and Table 1). The first erupted material was an ash fall noted about 
11h before the main Plinian eruption’; this pre-Plinian ash flow might 
have come from the lath-like conduit, before the dyke reached the 
surface (between the situations shown in Fig. 3a, b). 
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dyking (coloured red) into rock above an inflated dipping sill. b, Continued 
dyking leads to the injection earthquake (projection of CMT 1 (Table 1) in the 
azimuth of the dyke is shown) just over 2 h before the explosive eruption. The 
dyke, roughly in the plane of the page, impinges on a vertical strand of the LOFZ 
beneath Chaitén, the location of the narrow lath-like conduit. c, On 3 May, 
nearly 19h after the main Plinian eruption began, the dipping sill collapses 
catastrophically (projection of CMT 2 (Table 1) in the direction of the down- 
dip azimuth for the deep collapse body is shown) after a critical amount of 
magma has been withdrawn. The two standard deviation error bars for the 
CMT depths are shown at the left of b and c. 


Our model of the magmatic system beneath Chaitén favours a com- 
mon crustal magma chamber for Chaitén and Michimahuida, both of 
which erupt on a timescale of hundreds to thousands of years. Basalt 
intrusions into the magma chamber beneath Michimahuida must 
occur frequently to keep Michimahuida active, but how these intru- 
sions promote rhyolite exfiltration towards the Chaitén reservoir 
remains obscure. We speculate that crustal tectonics could have a 
two-way role in reshearing the LOFZ pop-up structure, assisted by 
magmatic overpressure, and draining the rhyolite from a more mafic 
mush. Alternatively, some event such as the 1960 earthquake could 
lead to the formation of the Chaitén magma reservoir by the mech- 
anical introduction of permeability and a pressure gradient around an 
overpressured magma body beneath Michimahuida. Large subduction 
earthquakes occur ona scale of hundreds of years in this part of Chile’*. 
In the latter situation, the rhyolitic magma body beneath Chaitén has 
been filling since the great 1960 Chilean earthquake (Fig. 1) until the 
resulting overpressure exceeded the overlying rock strength in May 
2008. 


METHODS SUMMARY 


To model the interferograms, we first resampled the InSAR data by using a 
quadtree algorithm, thus reducing the large number of highly correlated data 
points in each InSAR image’’”® (Supplementary Figs 1 and 2). We conducted 
preliminary exploratory modelling using one, two and three rectangular 
sources of uniform opening and closing’'. The addition of the third source 
markedly reduced the variance of the residual compared with one and two 
sources. We then further investigated a model consisting of three independent 
patches of distributed opening or closing. Each distribution is described func- 
tionally by a two-dimensional Weibull distribution that includes a covariance 
(or interaction) parameter*”’. We used a constrained Monte Carlo nonlinear 
inversion scheme to find the location, strike, dip, maximum opening or closing 
and Weibull parameters for each distribution of opening or closing, a trun- 
cation parameter for each distribution (allowing some or all of the distribution 
to be of uniform opening or closing), and static shift and tilt parameters for 
each interferogram. This method allowed us to search a wide range of model 
geometries and distributions automatically and it also enabled the automatic 
modelling of sharp spatial discontinuities in opening or closing. In the limiting 
cases the two-dimensional Weibull distribution” can be a point source, a line 
source or an area of uniform opening or closing, but it can also be a teardrop- 
shaped or tadpole-shaped*’ dyke (Supplementary Fig. 4). The significance of 
misfit and uncertainties in model parameters discussed in the text were found 
by means of an F-test and a 95% probability. It has been shown” that the 
quadtree parsed data can be treated as independent data points, allowing us to 
use this method of uncertainty estimation (however, see Methods and 
Supplementary Fig. 3). 
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METHODS 


Modelling. To model the interferograms, we first resampled the InSAR data by 
using a quadtree algorithm, thus reducing the large number of highly correlated 
data points in each InSAR image’””® (Supplementary Figs 1 and 2). We conducted 
preliminary exploratory modelling using one, two and three rectangular sources of 
uniform opening and closing”’. The addition of the third source markedly reduced 
the variance of the residual compared with one and two sources. We then further 
investigated a model consisting of three independent patches of distributed open- 
ing or closing. Each distribution is described functionally by a two-dimensional 
Weibull distribution that includes a covariance (or interaction) parameter’. We 
used a constrained Monte Carlo nonlinear inversion scheme to find the location, 
strike, dip, maximum opening or closing and Weibull parameters for each distri- 
bution of opening or closing, a truncation parameter for each distribution (allow- 
ing some or all of the distribution to be of uniform opening or closing), and static 
shift and tilt parameters for each interferogram. This method allowed us to search 
a wide range of model geometries and distributions automatically and it also 
enabled the automatic modelling of sharp spatial discontinuities in opening or 
closing. In the limiting cases the two-dimensional Weibull distribution” can be a 
point source, a line source or an area of uniform opening or closing, but it can also 
be a teardrop-shaped™ or tadpole-shaped”’ dyke (Supplementary Fig. 4). 
Because the CMT solutions support the presence of the dipping reservoir and the 
dyke, but not the conduit, we tried modelling the InSAR data with only two sources. 
Models consisting of only two deforming bodies yielded a significantly worse fit, 
assuming the quadtree data are independent”, with less than a 1 in 10” chance that 
the higher variance compared with the best-fit models could be attributed to random 
fluctuations in the data. The collapsing conduit is a necessary feature. Before 
descending data were available over Chaitén, it was found’ that the model that best 
fitted the ascending ALOS data (Fig. 1b, c), out of several models ranging from a point 
pressure source to models including slip, was a narrow steeply dipping uniform 
collapsing rectangular dyke, similar to the conduit in our model. We also modelled 
the data with a dyke and a listric fault (a curved surface) occupied by both the conduit 
and the reservoir, but the fit to the data was significantly worse than the best-fit 
models, with about a 1 in 10° chance that the higher variance compared with the best- 
fit models could be attributed to random fluctuations in the data. The conduit and 
reservoir are therefore not continuous on a curved surface, and the overshoot of the 
large ENE dipping reservoir up-dip from the conduit is a required feature. 
Uncertainties. The significance of misfit and uncertainties in model parameters 
discussed in the text were found by means of an F-test and a 95% probability. It has 
been shown” that the quadtree parsed data can be treated as independent data 
points, allowing us to use this method of uncertainty estimation (Supplementary 


Fig. 3); however, correlated noise might bias these estimates. To address concerns 
that correlated noise in the interferograms might affect our uncertainty estimates, 
we use a method of model parameter uncertainty estimation that involves estim- 
ating noise covariance for each interferogram (for background on the method see 
refs 26, 32, 33). We first found a covariance function for each interferogram using 
data outside the deforming area, then approximated the covariance function C(r) 
with an simple exponential decay function*? C(r) = oe *”, where r is the distance 
between data points, o” is the variance and » is the decay constant. We then 
constructed noise covariance matrices for each interferogram, from which we 
propagated noise covariance matrices for the quadtree data for each interferogram 
(Supplementary Figs 6a and 7a). Using the quadtree covariance matrices (follow- 
ing refs 32, 33) we calculated possible realizations of correlated noise for each 
quadtree point (Supplementary Figs 6b and 7b), added the noise to the data, and 
then found the resulting optimal model. We performed this procedure 100 times 
and plotted the distribution of dips for the dipping reservoir, which has been taken 
to represent the error (Supplementary Fig. 3). If the noise covariance estimates of 
uncertainty are accurate and if the F-test uncertainties are biased by uncorrelated 
noise, we would expect the F-test uncertainty intervals to be narrower than the 
noise covariance uncertainty intervals. However, comparison of the dip uncer- 
tainty intervals reveals that the range of uncertainties in dip using the noise 
covariance is much narrower than the range of F-test uncertainties. We suggest 
that the noise covariance method of uncertainty estimation either does not properly 
interrogate the data structure or does not thoroughly investigate the trade-off in 
model parameters (or both). Further investigation of these differences requires 
more study beyond the scope of this paper. In the worst case, the noise covariance 
uncertainties represent an additional +4° of uncertainty to our F-test uncertainties 
of about +13°; this would put the uncertainties close to the red line in 
Supplementary Fig. 3. From this comparison of uncertainties, we conclude that 
the role of uncorrelated noise on our F-test uncertainties is small (at the worst) and 
does not change our interpretation or conclusions. 
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Primary forests are irreplaceable for sustaining 


tropical biodiversity 
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Human-driven land-use changes increasingly threaten biodiversity, 
particularly in tropical forests where both species diversity and 
human pressures on natural environments are high’. The rapid 
conversion of tropical forests for agriculture, timber production 
and other uses has generated vast, human-dominated landscapes 
with potentially dire consequences for tropical biodiversity~>. 
Today, few truly undisturbed tropical forests exist, whereas those 
degraded by repeated logging and fires, as well as secondary and 
plantation forests, are rapidly expanding®’. Here we provide a global 
assessment of the impact of disturbance and land conversion on 
biodiversity in tropical forests using a meta-analysis of 138 studies. 
We analysed 2,220 pairwise comparisons of biodiversity values in 
primary forests (with little or no human disturbance) and disturbed 
forests. We found that biodiversity values were substantially lower 
in degraded forests, but that this varied considerably by geographic 
region, taxonomic group, ecological metric and disturbance type. 
Even after partly accounting for confounding colonization and 
succession effects due to the composition of surrounding habitats, 
isolation and time since disturbance, we find that most forms of 
forest degradation have an overwhelmingly detrimental effect on 
tropical biodiversity. Our results clearly indicate that when it comes 
to maintaining tropical biodiversity, there is no substitute for 
primary forests. 

As the extent of primary forests is shrinking throughout the tropics, a 
growing body of work has quantified the biodiversity values of degraded 
tropical forests. The ecological responses following forest conversion 
vary markedly across taxonomic groups, human impact types, ecological 
metrics and geographic regions**'°. Most studies, however, provide 
limited insight into the varied responses of tropical forest biota to human 
impacts because they are understandably restricted to particular distur- 
bance types'"”’, taxa’*"* and geographic regions’*. Therefore, their often 
contrasting conclusions might have clouded ongoing debates over the 
conservation value of modified forest ecosystems*. A comprehensive 
meta-analysis of the conservation value of human-modified tropical 
forests is therefore sorely lacking. Notably, such an assessment could 
provide a critical baseline for monitoring progress towards global con- 
servation targets'®, evaluate the biodiversity benefits of international 
carbon-trading initiatives to reduce emissions from deforestation and 
forest degradation’”"* (for example the United Nations REDD+ pro- 
gramme), and guide policy development through the integration of 
biodiversity data into the modelling of land-use change scenarios*’?”®. 

Here we conduct a global meta-analysis to measure the varied effects 
of land-use change and forest degradation on biodiversity in tropical 


forests. From an exhaustive literature search, we identified 138 studies 
that reported measures of biodiversity from multiple sites in both 
primary and disturbed tropical forests (Methods). We necessarily 
assumed that all ‘primary forests’ referred to in our source literature 
are largely old-growth forests that have experienced little to no recent 
human disturbance, although we recognize that in reality few primary 
forests are likely to be genuinely pristine. Primary forests are starkly 
differentiated from disturbed sites, which encompass the full spectrum 
of degraded and converted forest types, including selectively logged 
forests, secondary forests and forests converted into various forms of 
agriculture. In total, these studies spanned 28 countries and 92 study 
landscapes (Fig. 1). To measure the effect size of human-driven land- 
use changes, we calculated the weighted average of the standardized 
difference (based on pooled variance measures) between mean bio- 
diversity measurements in primary and disturbed sites’! (that is, 
Hedges’ g*). The effect size was positive when the biodiversity value 
of primary forest sites was greater than that of disturbed sites, implying 
that the measured disturbance had a detrimental impact on bio- 
diversity. We used a resampling procedure based on 10,000 bootstrap 
samples (with replacement) to generate the median effect size and 95% 
confidence intervals. 

Overall, human impacts reduced biodiversity in tropical forests, 
although the effect size varied by region, taxonomic group, metric 
and disturbance type (Fig. 2). The median effect size for all 2,220 
pairwise comparisons from 138 studies was 0.51 (95% confidence 
interval, 0.44-0.58) (Supplementary Table 1). This changed little when 
we accounted for pseudoreplication from studies that reported mul- 
tiple comparisons, using a resampling procedure in which one com- 
parison per study was randomly drawn for 10,000 samples, yielding an 
overall effect size of 0.57 (0.35-0.79) (Supplementary Table 1). Our 
results are also robust to publication biases (Methods). The surround- 
ing habitat might either ameliorate (if hospitable) or exacerbate (if 
hostile) the impact of forest disturbance on biodiversity”. Although 
data are lacking for a comprehensive analysis, to account partly for this 
effect we repeated our analysis using only those studies that had nat- 
ural vegetation (that is, primary and selectively logged forests) as the 
surrounding habitat (70.1% of all pairwise comparisons). Using this 
subset, we detected no substantial change in either the direction or the 
magnitude of effect sizes for the full data set (0.58, 0.49-0.68), or for 
each of the variables described below (Supplementary Table 1). 

We found that human impacts on biodiversity varied by region. 
Although our data set is highly comprehensive, it is still limited given 
the vast extent of tropical forests and the myriad ways in which 
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Figure 1 | Map of study sites by country and by study location. Country colour represents the number of studies per country (1 = 28 total countries) and circle 
size represents the number of studies at each site (n = 92 total sites; only 82 sites with Global Positioning System coordinates are shown). 


humans disturb them”. Asia (52 studies) and South America (47) were 
the subjects of considerably more studies than were Central America 
(27) and Africa (12) (Fig. 1 and Supplementary Table 1). This regional 
bias implies that our findings might be more generalizable to Asia and 
South America than to other tropical regions. More critically, it high- 
lights an urgent need for more research, particularly in Africa, which 
sustains the second largest contiguous tropical forest in the world’. 
Despite this important caveat, we found that Asia harbours the most 
sensitive biota, producing an effect size of 0.95 (0.83-1.08), which is 
substantially higher than that of the other three regions (Fig. 2a). This 
highlights the great toll human land-use changes are exacting in Asia, 
particularly in Southeast Asia, which most Asian studies (44 of 52) 
considered. Recent and widespread expansion of oil palm mono- 
culture and exotic-tree plantations has greatly modified forest habitats 
in this region”, but all forms of human impact were higher in Asia than 
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elsewhere (Fig. 3a), suggesting that this regional pattern holds regard- 
less of disturbance type. Our results highlight the critical need to 
mitigate the particularly detrimental human impacts in Asia”. 

Most taxonomic groups we assessed were negatively affected by 
disturbance, with effect sizes greater than 0.5 (Fig. 2b and Sup- 
plementary Fig. 1b). However, mammals were less sensitive to the 
disturbances measured and, in some instances, actually benefitted 
from human disturbance, with an effect size of —0.12 (—0.24 to 
—0.01). This disparity, largely due to higher mammal abundances in 
certain disturbance types (Fig. 3b and Supplementary Table 3), might 
arise because of mammals’ high tolerance of degraded forests and 
forest edges”®, particularly among small mammals (—0.04, —0.27 to 
0.20) and bats (— 0.24, —0.42 to —0.06), which dominated most studies 
on mammals (Supplementary Table 1). At the other extreme, birds 
were the most sensitive group, with an effect size of 0.72 (0.52-0.93). 
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Bootstrapped effect size 


Figure 2 | Box plots of bootstrapped effect size. a, By region; b, by taxon; c, by 
response metric; d, by disturbance type (omitting clear-cut and disturbed/ 
hunted owing to small sample sizes, that is, <50 comparisons). Plotted are 
median values and interquartile ranges of 10,000 resampled (with replacement) 
effect size calculations for each group. Widths of notches in box plots 


approximate 95% confidence intervals. Median value for forest species richness 
(FSR) is plotted for comparison. The vertical black and grey dashed lines 
represent an effect size of zero and the median effect size for the entire data set, 
respectively. Sample size is shown in parentheses. 
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Figure 3 | Box plots of bootstrapped effect size. a, By disturbance type; b, by 
response metric, as in Fig. 2. Median effect size is also plotted as a function of 
region and taxon, with overlapping points stacked: Af, Africa; As, Asia; CA, 
Central America; SA, South America; a, arthropods; b, birds; m, mammals; p, 
plants. Vertical lines are as in Fig. 2. 


These results varied by disturbance type; birds constituted the group 
most sensitive to forest conversion into agriculture (active agriculture, 
abandoned agriculture and agroforestry systems), whereas plants 
constituted the group most sensitive to burned forests and shaded 
plantations (Fig. 3a and Supplementary Table 2). The effect size for 
arthropods (0.64, 0.52-0.78) when further differentiated into the 
three main taxonomic orders revealed some differences: Coleoptera 
was more sensitive to disturbance (1.01, 0.75-1.30) than were 
Hymenoptera (0.41, 0.11-0.69) and Lepidoptera (0.58, 0.28-0.89) 
(Supplementary Table 1). In general, our findings reflect a paucity of 
information about most of the world’s tropical biota; more data are 
needed to understand the ecological mechanisms underlying the dif- 
fering vulnerability of taxa to human disturbance”’. 

The source literature we considered used various measures of bio- 
diversity, which we broadly differentiated into five response metrics: 
abundance, community structure and function, demographics, forest 
structure, and richness (Methods, Fig. 2c and Supplementary Fig. 1a). 
Of these, abundance and richness were the most commonly reported 
metrics, together comprising over three-quarters of all pairwise 
comparisons. Richness (0.83, 0.72—0.95) was markedly more sensitive 
to human disturbance than abundance (0.19, 0.07—0.31) (Figs 2c and 
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3b and Supplementary Tables 2 and 3). This result accords with expec- 
tations, given observations of large increases in the abundance of 
generalist species following similarly large declines in richness in 
degraded tropical forests*’*. Furthermore, our measure of richness 
was predictably conservative because it assessed both forest specialists 
and generalists; when restricted to forest specialists (n = 70 compar- 
isons), the effect size for species richness increased to 1.16 (0.69-1.65) 
(Fig. 2c and Supplementary Table 1). Measures of forest species rich- 
ness therefore could serve as a simple yet effective metric to assess the 
conservation value of tropical forests and the relative impacts of dif- 
ferent patterns of human modification, particularly during the early 
stages of forest conversion when conservation actions are most 
urgently needed. 

We identified 12 general forest disturbance or conversion classes, 
and all but one of those with adequate sample sizes had effect sizes 
greater than 0.4 (Supplementary Table 1). In general, agricultural land- 
use classes (abandoned and active agricultural sites) had a much 
greater impact than agroforestry systems and plantations (both shaded 
and unshaded) (Fig. 2d). As the single exception, selectively logged 
forests (largely those affected by a single cutting cycle) had a much 
smaller, yet still positive, effect size of 0.11 (0.01-0.20). This is con- 
sistent with previous studies showing that selectively logged forests 
retain a high richness of forest taxa’*. Although these findings suggest 
that logged forests could contribute to biodiversity conservation, there 
are several caveats that need consideration: (i) if logged forest sites are 
adjacent to primary forests, spill-over effects might exaggerate the 
species richness of logged forests” (acting as sink habitats); (ii) the 
proximity of logged forests to primary forests might also result in 
species extinction debts that are repaid over lengthy periods of time, 
beyond the timescale of the short-term studies that comprise most of 
our data set (83.6% had a time since disturbance of =12 yr); (iii) 
repeated logging might further exacerbate these biodiversity impacts; 
and (iv) the networks of forest roads created by logging operations 
might facilitate human immigration to forest frontiers and trigger 
associated increases in fires and forest conversion”. As selective 
logging continues to expand across the tropics*’, understanding its 
long-term impacts and interactions with other forms of disturbance 
such as fire and invasive species” will become increasingly important 
for the conservation of tropical biodiversity. 

In contrast with the relatively benign selectively logged forests, 
secondary forests of varying ages had an intermediate effect size of 
0.41 (0.28-0.54). It has been suggested recently that secondary forests 
can bean effective complement to primary forests in supporting tropical 
biodiversity, and should therefore represent a priority for con- 
servation’’. Although the wide variety of secondary forests measured 
vary markedly in biodiversity value depending on forest age and land- 
use history, our meta-analysis demonstrates that secondary forests 
invariably have much lower biodiversity values than do remnant areas 
of relatively undisturbed primary forest (Supplementary Table 2). 
Although regenerating degraded areas can greatly increase the long- 
term persistence of biodiversity in severely modified landscapes’, our 
findings suggest that protecting remaining primary forests and restor- 
ing selectively logged forests are likely to offer the greatest conser- 
vation benefits for tropical biota. 

We tested the relative importance of the above-mentioned eco- 
logical correlates in explaining the effect size. We used an informa- 
tion-theoretic approach to evaluate the performance of a candidate set 
of generalized linear models (Methods). After controlling for pseudo- 
replication from studies, the most parsimonious model in predicting 
the impact of anthropogenic forest disturbance on effect size was the 
null model (selected in 37.3% of 10,000 iterations), with the models 
‘Region’ (23.1%) and ‘Response metric’ (14.4%) ranked second and 
third, respectively (Supplementary Table 5). This result also holds for 
a data set that includes only studies with natural vegetation as the 
surrounding habitat (m = 1,557), as well as for a smaller subset of 
data with information on time since disturbance and mean isolation 
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distance (n = 630; accounting for variation in colonization and suc- 
cession effects) (Supplementary Fig. 2 and Supplementary Table 5). 
Our analysis of generalized linear models showed that the observed 
detrimental disturbance effects are essentially universal and that cor- 
relates such as region, taxonomic group, disturbance type and eco- 
logical measure have little impact on the effect size. 

Our meta-analysis provides a global assessment of the relative con- 
servation value of a broad range of human-modified tropical forests. 
Our results demonstrate that forest conversion and degradation con- 
sistently and greatly reduce biodiversity in tropical forest landscapes. 
As an exception, selective logging of forests has a much lower 
detrimental effect on measured biodiversity responses, implying that 
ecological restoration of such areas could help to alleviate threats to 
tropical biodiversity. Overall, however, we conclude that primary 
forests are irreplaceable for sustaining tropical biodiversity. Con- 
sequently, we strongly urge their protection by enhancing of enforce- 
ment in existing protected areas, expanding the current network of 
reserves and curbing international demand for forest commodities 
obtained at the expense of primary forests. Improving mechanisms 
for delivering and sustaining the social, financial and technical support 
necessary to achieve such goals continues to present one of the greatest 
challenges to tropical biodiversity conservation in the twenty-first cen- 


tury. 
METHODS SUMMARY 


Using Web of Science and BIOSIS, we searched for all relevant research articles 
published between 1975 and October 2010 that (i) included measures of bio- 
diversity at multiple sites in both primary and disturbed tropical forests, (ii) 
indicated that the primary forests had little or no human disturbance and (iii) 
reported variance measures for biodiversity responses. From these studies, we 
compiled the biodiversity measures reported in both primary and disturbed forest 
sites and classified these measures using four variables: geographic region, 
taxonomic group, ecological response metric and disturbance type. For each 
paired biodiversity measure, we calculated the bias-corrected Hedges’ g*, the 
difference between primary and disturbed group means standardized by the 
pooled standard deviation. We then calculated the average effect size using 
the random-effects model, where effect sizes of individual comparisons are 
weighted by the inverse of within-study variance plus between-study variance”’. 
We repeated this procedure after resampling the effect size calculations using 
10,000 bootstrap samples (with replacement), from which we generated 95% 
confidence intervals. We calculated the effect size for the entire data set, for each 
subgroup of the four variables (region, taxon, response metric and disturbance 
type) and for each of the six two-level combinations of the four variables (for 
example disturbance type X region). We repeated the above calculations for a 
subset of the data set with natural surrounding habitat, to account for the influence 
of this habitat. We also tested the effect sizes for possible publication bias. 
Following ref. 15, we performed an information-theoretic evaluation ofa candidate 
set of generalized linear models to examine the influence of a set of proposed 
factors on the ecological responses tabulated. The generalized linear models related 
the Hedges’ g* effect size to the categorical predictor variables region, taxonomic 
group, metric and disturbance type in the 15 possible variable combinations. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Data. We searched for all relevant research articles published between 1975 and 
October 2010 using Web of Science and BIOSIS with the search query (TS = 
[(bird* OR mammal* OR reptile* OR amphibia* OR arthropod* OR plants* OR 
lepidoptera* OR hymenoptera* OR arachnid* OR coleoptera* OR diptera* OR 
homoptera* OR isoptera*) AND (clear-cutting* OR log* OR deforestation* OR fire* 
OR agriculture conversion* OR disturbance* OR degradation* OR secondary forest* 
OR plantation* OR fragment*)]). From this list, we reviewed articles and retained 
those studies that (i) included measures of biodiversity at multiple sites in both primary 
and disturbed tropical forests, (ii) indicated that the primary forests had little or no 
human disturbance and (iii) reported variance measures for biodiversity responses. 
We defined primary forests as primary or old-growth forests that have never been 
clear-felled and have been impacted by little or no known recent human disturbance. 

For each study, we recorded the biodiversity measures in both primary and dis- 
turbed forest sites. For those studies that reported results in figures only, we extracted 
results using DATATHIEF (http://www.datathief.org). The full data set is available in 
the online version of the paper. For each comparison, we recorded the region (Africa, 
Asia, Central America (including Mexico), South America) and broad taxonomic 
group (arthropods, birds, mammals, plants). Although arthropods span diverse 
groups with potentially differing responses to human impacts’, our sample included 
predominantly insects (Coleoptera, 29.2%; Hymenoptera, 22.9%; Lepidoptera, 
22.6%) and we therefore treated it as a single group but reported differences between 
the three major insect orders represented. Mammals also comprised different groups, 
and we differentiated between bats (51.0%), large mammals (2.6%), primates (3.7%), 
small mammals (28.2%) and a miscellaneous group (14.4%). 

We classified the biodiversity measure into five response metrics: abundance 
(for example density, capture frequency, occupancy estimates and biomass); 
community structure and function (for example abundance of different guilds 
(generalists, herb specialists and so on), proportion of trait states and individual 
weight); demographics (for example density of different age classes (adults/ 
juveniles/saplings/seedlings), fruit/flower production and genetic measures); 
forest structure (for example canopy height/cover/openness, basal area, litter 
depth, diameter at breast height and other physical structural measurements, 
and density of trees of a given diameter at breast height); and richness (for example 
observed/estimated/rarefied richness, species density and genera/family richness). 
We omitted diversity indices (n = 151; for example Fisher’s alpha, Shannon- 
Wiener, Simpson’s and Margalef’s) because they were usually secondary (derived) 
measures of abundance and/or richness and are not straightforward to interpret. 

We recorded the disturbance type as specified by the authors of the source 
literature, which formed twelve distinct groups: abandoned agriculture, active 
agriculture, agroforestry, burned forests, clear-cut forests, disturbed/hunted 
forests, other extracted forests, pastures, plantations, secondary forests, selectively 
logged forests and shaded plantations. To avoid an inadequate treatment of forest 
fragmentation, which is an important topic, we necessarily excluded data on forest 
fragments. However, we recognize that remnant forest fragments, particularly 
large ones, in heavily human-modified ecosystems might be critical for biodiver- 
sity persistence. 

In addition, and where available, we collected data on patch size, surrounding 
habitat type, isolation distance and time since disturbance’*”. We categorized the 
predominant surrounding habitat of disturbed forests into five broad groups: natural 
vegetation (that is, primary and selectively logged forests), agriculture, disturbed 
forests, pastures and tree plantations. Using maps and/or geo-referenced locations 
from the source literature, we calculated isolation distance as the mean distance 
between disturbed sites and the nearest primary forest site to account for coloniza- 
tion effects for a smaller set of the data. We measured time since disturbance as the 
amount of time that had elapsed between the most recent form of disturbance and 
the time of study, as indicated by the authors of the source literature, to account for 
post-disturbance and time-lag effects. We excluded patch size or area information 
from our analysis largely as a result of ambiguity and extremely low sample size 
(22.6% of the comparisons provided this information for disturbed sites). We have 
already acknowledged the potential confounding effects of area in detail elsewhere’. 
Meta-analysis. For each comparison, we calculated Hedges’ g, the difference 
between primary and disturbed group means standardized using the pooled stand- 
ard deviation of the two groups”, defined as: 


_ Xprimary — “disturbed 
SDpooled 


where 


7) 3 
= (Mprimary — 1) SD rimary + (Maisturbed — 1) SDGisturbed 
pooled 
Nprimary + Ndisturbed — 2 


Because Hedges’ g is a biased estimator of population effect size, we used the 
conversion factor J to compute a bias-corrected metric, g* (ref. 21), defined as 
g* = Jg, where 
3 
A(Mprimary + Ndisturbed — 2) — 1 


J 


We then calculated the average effect size using the random-effects model, where 
effect sizes of individual comparisons are weighted by the inverse of within-study 
variance plus between-study variance’. For individual comparisons, we defined 
the effect size as positive for comparisons where the biodiversity value was higher 
in primary forest (such that a positive effect size indicates a more detrimental 
impact by the disturbance type). For a small subset of comparisons where the 
expected value would be lower in primary forest (1 = 180, 8.1% of all pairwise 
comparisons; for example measures of saplings/seedlings/juveniles, early/mid- 
successional species, non-forest/open-forest species, common/generalist/visitor 
species, trees of diameter at breast height <10 cm, dead/new trees and mortality/ 
recruitment rates), we defined the effect size as negative for comparisons where 
the biodiversity value was higher in primary forest. As our results might be 
affected by the selection of comparisons with an opposite expectation of the 
direction of the effect, we repeated the procedure after omitting those compar- 
isons. This led to an effect size of 0.45 (0.38-0.52), within the error of the effect 
size for the full data set, suggesting that our expectation did not affect the results 
(Supplementary Table 1). 

We calculated the effect size for the entire data set, for each subgroup of the four 
variables (region, taxon, response metric and disturbance type) and for each of the 
six two-level combinations of the four variables (for example disturbance type X 
region) (Fig. 3, Supplementary Fig. 1 and Supplementary Tables 2-4). For all 
combinations, we repeated this procedure after resampling the random-model 
effect size calculations using 10,000 bootstrap samples (with replacement), from 
which we generated 95% confidence intervals*'. To address potential spatial and 
temporal autocorrelation from studies that included several comparisons (for 
example multiple measurements of the same taxa, measurements of multiple taxa 
and measurements of multiple disturbance types), we repeated this procedure 
after resampling one comparison per study, again using 10,000 bootstrap samples 
(Supplementary Table 1). However, some autocorrelation (largely only spatial) 
remains because several studies were situated in the same site (Fig. 1), although 
it is probably not as pronounced as above. To account for the potential influ- 
ence of the surrounding habitat, we repeated the above calculations for a subset 
of the data set with natural surrounding habitat (70.1% of data) (Supplementary 
Table 1). 

We tested for publication bias using two methods to assess whether our calcu- 
lated effect sizes were affected by the possible absence of studies not published 
owing to a failure to detect differences”. First, we visually examined a funnel plot 
of effect size plotted against standard error to assess the symmetry of study pre- 
cision around effect size (Supplementary Fig. 3). The relatively symmetrical funnel 
plot suggests there is no relationship between effect size and study size, and that 
those studies with small (or negative) effect sizes do not have a lower probability of 
being published. Second, we sorted the data set by precision, from comparisons 
with small standard errors to those with large standard errors, and examined the 
change in cumulative effect size with the addition of the most imprecise studies 
(Supplementary Fig. 4). Although the addition of the most imprecise third of 
comparisons (those with the largest standard errors) does cause the cumulative 
effect size to increase, the effect size remains positive and does not overlap with 
zero at any point after the first 163 comparisons. We conclude that the impact of 
publication bias in our study is slight’. 

Generalized linear models. Following ref. 15, we performed an information- 
theoretic evaluation of a candidate set of generalized linear models (GLMs) to 
examine the influence of a set of hypothesized factors on the ecological responses 
tabulated. The GLM related the Hedges’ g* effect size to the categorical predictor 
variables region, taxonomic group, metric and disturbance type in the 15 possible 
variable combinations (Supplementary Table 5). We also evaluated the null 
(intercept-only) model, in which only a mean effect size is estimated (that is, 
no correlates). As with the meta-analysis, we accounted for pseudoreplication by 
selecting a random subset of the full data set, such that only one observation from 
each study was fitted using GLMs, and repeating the fitting procedure a total of 
10,000 times. Model comparisons and subsequent inference (using relative 
weights of evidence) were based on the small-sample-size-corrected Akaike’s 
information criterion (AIC, ref. 32), whereby a measure of Kullback-Leibler 
information loss (a fundamental conceptual measure of the relative distance of 
a given model from full reality, assumed to be represented in the model set) is 
derived and used as an objective basis for ranking the bias-corrected likelihood of 
models in an a-priori candidate set (thereby yielding an implicit estimate of 
model parsimony). The highest-ranked models according to AIC; are those that 
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explain the most substantial proportion of variance in the data yet exclude 
unnecessary parameters that cannot be justified for inference on the basis of 
the data*’. For the randomized GLM fits, we calculated the proportion of times 
each model was selected as the top-ranked model (7;), on the basis of AIC.. We 
used the per cent deviance explained to represent the structural goodness of fit of 
each model, with the 95% confidence interval of the per cent deviance explained 
estimated as the 2.5 and 97.5 percentiles of the 10,000 sample fits. We repeated 
the above analysis using only data with natural surrounding habitat, and using 
isolation distance and time since disturbance as additional predictor variables, 
thus increasing the possible variable combinations to 64 (including the null 
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model) (Supplementary Table 5). All statistical analyses and figures were made 
using the program R, version 2.11.1 (ref. 34). 
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Corridors of migrating neurons in the human brain 
and their decline during infancy 


Nader Sanai!*?, Thuhien Nguyen’, Rebecca A. Ihrie', Zaman Mirzadeh!?, Hui-Hsin. Tsai!, Michael Wong’, Nalin Gupta’, 
Mitchel S. Berger’, Eric Huang*, Jose-Manuel Garcia-Verdugo”, David H. Rowitch'*® & Arturo Alvarez-Buylla’? 


The subventricular zone of many adult non-human mammals 
generates large numbers of new neurons destined for the olfactory 
bulb’ *. Along the walls of the lateral ventricles, immature neuronal 
progeny migrate in tangentially oriented chains that coalesce into a 
rostral migratory stream (RMS) connecting the subventricular zone 
to the olfactory bulb. The adult human subventricular zone, in 
contrast, contains a hypocellular gap layer separating the ependy- 
mal lining from a periventricular ribbon of astrocytes’. Some of 
these subventricular zone astrocytes can function as neural stem 
cells in vitro, but their function in vivo remains controversial. An 
initial report found few subventricular zone proliferating cells and 
rare migrating immature neurons in the RMS of adult humans’. In 
contrast, a subsequent study indicated robust proliferation and 
migration in the human subventricular zone and RMS*”. Here we 
find that the infant human subventricular zone and RMS contain an 
extensive corridor of migrating immature neurons before 18 
months of age but, contrary to previous reports’, this germinal 
activity subsides in older children and is nearly extinct by adult- 
hood. Surprisingly, during this limited window of neurogenesis, not 
all new neurons in the human subventricular zone are destined for 
the olfactory bulb—we describe a major migratory pathway that 
targets the prefrontal cortex in humans. Together, these findings 
reveal robust streams of tangentially migrating immature neurons 
in human early postnatal subventricular zone and cortex. These 
pathways represent potential targets of neurological injuries affect- 
ing neonates. 

We collected human brain specimens from 10 neurosurgical re- 
sections and 50 autopsied brains, ranging in age from birth to 84 years 
(Supplementary Fig. 1 and Supplementary Table 1), using a protocol 
that allows subsequent analysis by fluorescent immunohistochemistry 
and in situ hybridization (see Methods). As shown (Fig. 1), staining of 
horizontal sections (30 jm) through the anterior horn of the lateral 
ventricle demonstrates that, in the first 6 months of life, the structure of 
the human subventricular zone in infants differs considerably from 
that observed in adults. The astrocytic ribbon and gap layer are not 
evident (Fig. 1) and, as seen in the fetal human brain’*”, cells with 
elongated radial glial processes line the lateral ventricular wall and 
express vimentin and glial fibrillary acidic protein (GFAP)". Adjacent 
to these radial glia, we observed a dense network of elongated unipolar 
and bipolar cells oriented tangentially to the ventricular lining. Many of 
these cells expressed the immature neuronal markers doublecortin 
(DCX) (Fig. 1) and B-III tubulin (TUBB3). Some putative immature 
neurons also expressed polysialylated neural cell adhesion molecule 
(PSA-NCAM), which is present in migratory cells. They also had ultra- 
structural features of immature migrating neurons (Supplementary 
Fig. 2) similar to neuroblasts described in the rodent subventricular 
zone’. Progressively, between 6 to 18 months of age, the subventricular 
zone is depleted of this dense network of putative migratory neurons 


and adopts the characteristic adult structure with an astrocyte ribbon 
and hypocellular gap layer. The emergence of the gap layer coincides 
with the decline in DCX* immature neurons (25-fold during the first 6 
months; n = 16, ages 0-17 years; Fig. 1) and proliferation, suggesting 
that human subventricular zone neurogenesis decreases drastically 
during the first 6 months of life. Only a small number of proliferating 
cells were present in adolescents and adults. Expression of the pro- 
liferation marker Ki67 was not associated with pyknotic nuclei, 
although we cannot exclude that some Ki67* nuclei correspond to 
apoptotic cells induced by ischaemia’*. We also identified a subpopula- 
tion of epidermal growth factor receptor (EGFR)-positive cells, a marker 
associated with early progenitors including neural stem cells'* and 
transit-amplifying cells in mice’*, which similarly diminished with time 
(Supplementary Fig. 3). A subset of EGFR* cells expressed Ki67, while 
others co-localized with DCX or PSA-NCAM, potentially representing 
transitional stages from transit-amplifying cells to immature neurons. 
Whole-mount en face preparations of the subventricular zone also con- 
firmed massive numbers of tangentially oriented DCX* chains within 
the gap layer at 1 week and 2 months of life. In stark contrast, cells with 
the morphology and marker expression of immature migratory neurons 
were extremely rare in adults (Supplementary Fig. 4). 

Our results indicate that robust streams of tangentially migrating 
immature neurons initially populate the postnatal human gap layer; 
these pathways become depleted between 6-18 months and this region 
transitions into a hypocellular gap. Previous work has also demon- 
strated a similarly sharp decline in EGFR and PSA-NCAM immuno- 
reactivity in the human subventricular zone during the first year of 
life’. Within the subventricular zone, the decline of migratory imma- 
ture neurons appears first along the posterior third of the lateral vent- 
ricle and then progresses in a posterior-to-anterior trajectory towards 
the ventral tip (data not shown). After 18 months, Ki67~ proliferative 
activity and the number of DCX* immature neurons assume trace 
levels seen in adults, leaving behind the gap layer characteristic of the 
adult human subventricular zone’. 

We next investigated whether proliferation and the presence of puta- 
tively migrating, immature neurons in the subventricular zone were 
associated with an active RMS. Using autopsied material (n = 6) (ages 
1 day; 1 week; 1, 3 and 6 months), serial sagittal and coronal recon- 
struction of the ventral forebrain revealed an uninterrupted column of 
cells that connected the ventral tip of the subventricular zone to the 
olfactory peduncle (Fig. 2). The descending/proximal limb of the pae- 
diatric RMS contained cells organized as chains—large collections of 
elongated immature neurons expressing DCX and surrounded by glial 
cells and processes'*'*"? (Fig. 2)—or as broad streams of individual cells. 
A subpopulation of these DCX” cells expressed PSA-NCAM. Con- 
versely, analysis of tissue from older children (n = 7; 2, 3,7, 16, 17 years) 
failed to reveal chains of migrating cells or evidence of an active 
RMS. However, individual or pairs of elongated DCX PSA-NCAM* 
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Figure 1 | Cytoarchitectural development of the human subventricular zone 
during the first 18 months of life. a—q, Illustrations localizing the horizontal 
30-uum sections of the anterior ventral subventricular zone (a, b) labelled with 
nuclear marker DAPI (c, fi, l, 0), the immature migrating neuronal marker DCX 
(d, g, j, m, p) and the immature glial marker vimentin (e, h, k, n, q). F, frontal 
lobe; LV, lateral ventricle; OB, olfactory bulb; T, temporal lobe. Large numbers 
of DCX” immature migrating neurons populate the region that develops into 
the human gap layer (yellow dotted lines). e, h, k, n, q, In the first few months of 
life, radial glia-like vimentin” cells are seen to populate the ventricular lining 
(e, h), and at 12 months onward, a dense network of vimentin* processes fill 
the gap area (n, q). DCX and DAPI are co-labelled in the same section (first and 
second columns), whereas vimentin staining is from adjacent sections (third 
column). r, DCX* immature neurons within the anterior ventral 
subventricular zone were quantified in ten X20 fields per section, ten sections 
per specimen, and 1-3 specimens per time point. Each data point represents 
the mean = s.d. ofa single specimen at the designated age (m, month; w, week; 
y, year). Inset demonstrates the field of quantification (yellow box) for each of 
the ten horizontal sections per specimen. s, Proliferating Ki67* subependymal 
cells were similarly quantified along the anterior ventral human subventricular 
zone. Taken together, these data demonstrate, during the first 18 months oflife, 
a sharp decline in proliferating cells and immature migrating neurons 
coinciding with emergence of the human subventricular zone gap layer. Scale 
bars, 30 um. 
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Figure 2 | The infant human RMS connects the subventricular zone to the 
olfactory peduncle. a, b, Illustrations indicating the anatomical site, plane and 
orientation of tissue sectioning. F, frontal lobe; LV, lateral ventricle; 

OB, olfactory bulb; T, temporal lobe. c, Reconstruction of serial sagittal sections 
of a haematoxylin-stained RMS from inferior lateral ventricle (V) to olfactory 
peduncle (OP) of an infant specimen (5 months of age). The subventricular 
source of the proximal RMS shows multiple tributaries of cells (yellow arrows) 
that appear to coalesce forming the descending (proximal) limb of the RMS. 
There is no evidence of a continuous ventricular system, but isolated 
ependymal islets (yellow arrowheads) are observed along the RMS. The distal 
limb of the RMS moves laterally, out of the sagittal plane of sectioning, as 
indicated by the yellow section frames. Yellow box (C) indicates region of 
proximal limb of RMS shown at high resolution in d. d, Clusters of DCX* PSA- 
NCAM* immature neurons at higher magnification. e, Individual DCX* 
neuronal chains, seen here in the ventral tip of the anterior horn of a 1-week 
specimen, are typically embedded in a matrix of GFAP" glial processes and 
often migrate along blood vessels (see Supplementary Fig. 4d). f, Ultrastructural 
analysis of a neuronal chain cross-section identifies immature neurons (type A 
cells, labelled “A’) surrounded by glia (type B cells, labelled ‘B’). Scale 

bars, 500 mm (c), 50 tum (d), 101m (e) and 2 um (f). 


putative migratory neurons were occasionally noted in late childhood 
specimens (n = 2; 3 and 7 years) and adult specimens (n = 5, ages 30, 
41, 61, 74, 84 years). These observations indicate that, whereas the 
infant brain contains a robust RMS with massive chain migration, 
such activity is reduced markedly in older children and adults’. 
Although our data do not rule out that rare immature neurons may 
sporadically migrate within the subventricular zone and RMS at later 
stages””, these findings do not support the previous finding of robust 
proliferation and abundant migration within the adult human subven- 
tricular zone’. 
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The distal limb of the RMS delivers subventricular zone neuronal 
progeny to the human olfactory peduncle, and then to the olfactory 
tract and olfactory bulb. Serial cross-sections of the olfactory tract 
(n=6; 1 day; 1 week; 5, 6 and 8 months) revealed clusters of 
DCX* PSA-NCAM* immature neurons within the V-shaped central 
core (Fig. 3). Electron microscopy analysis of the 6-month olfactory 
tract (n = 3) confirmed ultrastructural features of migrating immature 
neurons within the core. Cross-sections of the proximal olfactory tract 
core (5 mm anterior to the olfactory peduncle) at 8 months contained 
234 + 26 total cells and 173 + 24 DCX” cells per section, respectively, 
while the distal olfactory tract core (5mm posterior to the olfactory 
bulb) contained 216 + 20 total cells and 115+ 14 DCX* cells. In 
contrast, DCX* neurons were not detected in older olfactory tract 
specimens (n = 3; 18 months, 7 and 13 years) (Fig. 3). These data 
further support that an active RMS exists in early childhood, but is 
greatly reduced after 18 months of age. 

Although it has been suggested that an open olfactory ventricle 
persists into adult life’, we found no evidence of a ventricular extension 
in the olfactory tract at any of the ages studied, consistent with previous 
data indicating that the olfactory ventricle fuses before birth”*”°. Along 
the proximal limb of the RMS, we observed ependymal islets, but no 
evidence of a continuous open ventricle. These displaced and discon- 
tinuous islets were lined by multiciliated cuboidal cells expressing the 
ciliary marker acetylated-tubulin (Supplementary Fig. 5). In all studied 
specimens (n = 23), the anterior horn of the lateral ventricle was open, 
but no continuous, ependymal-lined lumen extending into the RMS or 
olfactory tract was observed. Thus, we infer that the human olfactory 
tract serves as a conduit for neuronal chain migration to the olfactory 
bulb, but these chains of migratory cells are only evident in infants and 
occur in the absence of a ventricular extension. 

In tracing the ependymal islets of the paediatric human RMS with 
serial coronal reconstructions ( = 3), we noted a decrease in calibre of 
the RMS from the proximal to distal limb (Fig. 4). On the basis of 
quantification of 4',6-diamidino-2-phenylindole (DAPI) nuclear 
staining, the proximal limb of the RMS contained 548 + 66 total cells 


per cross-section and 485 + 58 DCX* immature neurons per cross- 
section. In the distal limb, however, there were 228 + 24 total cells per 
cross-section and 189+ 17 DCX* immature neurons per cross- 
section, equating to a 58% decline in total cells and a 61% decline in 
immature neurons. The decreasing calibre of the RMS could be due to 
cell death, increased migratory speed, or immature migratory neurons 
taking alternative paths. TdT-mediated dUTP nick end labelling 
(TUNEL)-positive cells were present in the proximal and distal 
RMS, raising the possibility that apoptosis could contribute to this 
decline. However, serial coronal reconstruction of the frontal lobe also 
unexpectedly revealed an additional migratory stream of DCX” cells 
branching off the proximal limb of the RMS and ending in the ven- 
tromedial prefrontal cortex (VMPFC) (Fig. 4). This medial migratory 
stream (MMS) was observed in human specimens aged 4-6 months 
but not 8-18 months. Similar to the RMS, the MMS contains large 
clusters of DCX* and PSA-NCAM*“ cells with elongated morphologies, 
some adjacent to discontinuous ependymal islets (Supplementary 
Fig. 5). Early time points (<1 month) also revealed a more diffuse pattern 
of medially oriented migratory neurons emanating from the proximal 
limb of the primary RMS (Supplementary Fig. 6). Interestingly, cells 
expressing DCX, PSA-NCAM and the interneuron markers calretinin 
(CalR) and tyrosine hydroxylase (TH), were observed not only within 
this MMS but also within a restricted subregion of the VMPFC (Fig. 4). 
In contrast, very few DCX*, PSA-NCAM“*, CalR* and TH* cells were 
evident in adjacent areas of prefrontal cortex. Although it remains 
possible that alterations in immunoreactivity allow progeny to escape 
into adjacent regions undetected, these observations suggest that the 
MMS diverts immature subventricular zone neurons to target the 
VMPEC. 

A comparable MMS has not been reported in other vertebrates. We 
analysed serial sections of mouse brains at postnatal day (P)4, P8, P16 
and P20 and did not detect a MMS (Supplementary Fig. 7). However, 
some individual DCX™ cells were observed migrating ventrally and 
laterally in juvenile brain studied that might target homologous brain 
regions. In rodents and non-human primates, migratory neurons can 
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Figure 3 | Postnatal development and decline of the RMS in the human 
olfactory tract. a, c—e, h-j, Hlustration (a) localizes the anatomical sites for 
cross-sectional reconstructions of the proximal human olfactory tract at 1 day 
(c,h), 8 months (d, i) and 7 years (e, j). LV, lateral ventricle; OB, olfactory bulb. 
High-magnification fields (h-j) demonstrate progressive loss of DCX* PSA- 
NCAM‘* immature neurons in the olfactory tract. An active RMS is evident at 
birth and 8 months of age, with many DCX* PSA-NCAM* cells. By 7 years, 
however, chains of migrating cells are not observed and DCX™ cells are rare. 
b, Toluidine blue staining of a longitudinal semi-thin (6-1m) section from a 


384 | NATURE | VOL 478 | 20 OCTOBER 2011 


6-month olfactory tract demonstrates a central chain of darkened immature 
neurons surrounded by a matrix of lighter-coloured astrocytes. f, g, Electron 
microscopy also demonstrates clusters of type A cells (labelled “A’) enmeshed 
with type B cells within the olfactory tract core, including intercellular junctions 
(yellow arrows) between immature neurons characteristic of chain migration 
(f) and intermediate filaments (yellow arrowheads) within surrounding type B 
cells (labelled ‘B’; g). Scale bars, 500 ttm (c-e); 75 [um (h-j); 10 um (b); and 
0.5 tum (f, g). 
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Figure 4| A MMS of immature neurons branches from the proximal RMS 
in the infant human brain to supply the VMPFC. a, b, Illustrations 
demonstrate the anatomical site of the RMS (blue) and MMS (red). F, frontal 
lobe; LV, lateral ventricle; OB, olfactory bulb; T, temporal lobe. c, Using cross- 
sections of the proximal and distal RMS, quantifications of the total number of 
cells and total number of DCX" cells per cross-section indicated a significant 
reduction from proximal to distal RMS. Asterisks indicate P< 0.05. Data are 
shown as mean + s.e.m. d, Coronal reconstruction of a 6-month specimen 
reveals a PSA-NCAM* MMS diverging from the RMS to reach the VMPEC. 
e, f, Within this MMS, chains of PSA-NCAM‘ cells (e) co-express the 
immature neuronal marker DCX (f). g-l, Dense clusters of PSA-NCAM~ 
(g), calretinin’ (i) and TH (k) cells are specifically observed within a 
subregion of the VMPFC, but not at adjacent cortical sites (h, j, 1) superior or 
inferior to this subregion. Scale bars, 150 um (d), 20 jum (e-). 


also escape the subventricular zone along a sagittal plane to reach the 
islands of Calleja*'”’. 

Our study indicates that the region of the subventricular zone 
around the anterior lateral ventricles in the infant human brain is 
highly active, producing many tangentially migrating immature neu- 
rons. On the basis of the presence of an RMS containing chains of 
immature neurons, we infer that at least some of these progeny are 
destined for the olfactory bulb. Beyond 18 months of age, both prolif- 
erative activity and cells expressing markers of immature neurons are 
largely depleted, coinciding with the appearance of a hypocellular gap 
in the postnatal human subventricular zone. Thus, this layer of the 
subventricular zone initially serves as a thoroughfare for immature 
neurons. Surprisingly, paediatric human subventricular zone neuro- 
genesis also seems to serve regions other than the olfactory bulb, as 
evidenced by medially escaping immature neurons near the RMS. 
These groups of cells form a unique MMS targeting a subregion of 
the human prefrontal cortex. 

Previous work has suggested that postnatal neurogenesis may be 
important for learning and memory~** and that induction of plasticity 
may be closely linked to the timing of neuronal maturation®”*’. We 
speculate that the MMS, and other potential escape pathways from 
the subventricular zone, could supply interneurons to regions of 
the developing human brain as a mechanism of delayed postnatal 
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plasticity. Although the function of this recipient cortical domain, the 
VMPEIC, is unknown in children, this region in the adult human brain 
is activated during specific cognitive tasks**”’ including spatial concep- 
tualization and the emotional processing of visual cues. Interestingly, 
the VMPFC is also focally inactivated in patients with advanced 
Alzheimer’s disease*®. Beyond its functional implications, this develop- 
mental study of the human subventricular zone suggests a major period 
of neurogenesis and neuronal migration that extends well into post- 
natal life, but is largely limited to early childhood. This may hold 
important implications for our understanding of neonatal neurological 
diseases, including germinal matrix haemorrhages and perinatal 
hypoxic-ischaemic injuries, each potentially altering subventricular 
zone neurogenesis and its apparent downstream cortical targets at 
formative stages of human development. Perhaps most importantly, 
the detection ofa new migratory route for immature neurons within the 
infant human brain also highlights mechanisms through which 
increased regional complexity may be achieved during brain evolution. 
Note added in proof: A recent study” has found the RMS at fetal stages, 
but very few DCX-positive neurons in this same region in the adult 
human brain. 


METHODS SUMMARY 


Human specimens. Neurosurgical excisions of normal subventricular zone 
occurred as part of the planned margin of resection surrounding a periventricular 
lesion (Supplementary Table 1). Intra-operative specimens were histologically 
normal with no evidence of dysplasia, and assessments were independently con- 
firmed by an independent neuropathologist. For pathological specimens, autopsied 
brains were cut coronally at the mammillary bodies and immersed in 4% para- 
formaldehyde (PFA) for 1-2 weeks at 4°C, and then stored in 0.1M PBS. All 
specimens were collected with informed consent and in accordance with the 
University of California San Francisco Committee on Human Research (IRB no. 
H11170-19113). 

Rodent specimens. Postnatal mice at the specified ages were transcardially per- 
fused with 0.9% saline and 4% PFA, and dissected brains were post-fixed for 30 min 
in 4% PFA before vibratome sectioning. One-hundred-micrometre floating sections 
were stained and imaged using the tile scanning and three-dimensional projection 
modules on a Leica SP5 confocal microscope. 

Immunohistochemistry. Tissue sections were incubated with primary antibodies 
diluted overnight at 4°C. Sections were then incubated for 2.5h in secondary 
antibodies and then incubated in streptavidin-horseradish peroxidase for 
30 min. Antigen retrieval with proteinase K (10 jg ml’), or 0.01 M citrate buffer 
at 95 °C was used when necessary. 

Whole-mount dissection. Specimens were fixed in 4% PFA for 3 days, then rinsed 
in 0.1M PBS. The anterior horn of the lateral ventricles was excised (3mm 
squares) and rinsed. Tissue specimens were incubated in primary antibodies 
diluted in blocking solution for 2 nights. This step was repeated for secondary 
antibodies incubation. The ventricular face of tissue specimens was microdissected 
and collected at 200-300 um thickness and mounted. 

Electron microscopy. Specimens fixed in 2% glutaraldehyde and 2% para- 
formaldehyde were cut into 200-mm sections on a vibratome. Sections were 
post-fixed in 2% osmium, rinsed, dehydrated and embedded in Araldite 
(Durcupan, Fluka). To identify individual cell types, ultra-thin (0.05-mm) sections 
were cut with a diamond knife, stained with lead citrate and examined under a Jeol 
100CX electron microscope. 

Statistical measures. Cell counts were quantified and expressed as mean + 
standard deviation (s.d.) or standard error of the mean (s.e.m.). Statistical signifi- 
cance was determined by Student’s t-test. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Human specimens. Neurosurgical excisions of normal subventricular zone 
occurred as part of the planned margin of resection surrounding a periventricular 
lesion (Supplementary Table 1). We recorded the anatomical origin of each intra- 
operative specimen with intra-operative neuronavigation. Intra-operative specimens 
were histologically normal with no evidence of dysplasia, and assessments were 
independently confirmed by an independent neuropathologist. 

For pathological specimens, autopsied brains were cut coronally at the level of 
the mammillary bodies and immersed in 4% paraformaldehyde (PFA) for 1-2 
weeks, and then stored in 0.1 M PBS. The brains were then cut into 1-cm plates 
along the coronal, axial or sagittal plane. The anterior horn of the lateral ventricle 
and the ventral medial aspect, which includes the gyrus rectus and olfactory tract, 
of the frontal cortex were excised. Autopsy specimens were obtained within 12h 
of death from all individuals. All causes of death were non-neurological in origin 
and all patients had no evidence of intracranial disease. All specimens were 
collected with informed consent and in accordance with the University of 
California San Francisco (UCSF) Committee on Human Research (IRB no. 
H11170-19113). 

Rodent specimens. Postnatal male mice at the specified ages were transcardially 
perfused with 0.9% saline and 4% PFA, and dissected brains were post-fixed for 
30 min in 4% PFA before vibratome sectioning. One-hundred-micrometre float- 
ing sections were stained with rabbit anti-doublecortin (Cell Signaling), mouse 
anti-PSA-NCAM (Millipore) and DAPI (Sigma), mounted on Superfrost Plus 
slides, and imaged using the tile scanning and three-dimensional projection 
modules on a Leica SP5 confocal microscope. Composite images were assembled 
from individual X10 fields using Adobe Photoshop. All experiments were 
approved by the UCSF Institutional Animal Care and Use Committee (approval 
no. ANO77716). 

Immunohistochemistry. All fixed specimens were rinsed in 0.1 M PBS and then 
cut on a vibratome (50 jum), or cryoprotected in 30% sucrose and then cut on a 
cryostat (30 [um). Tissue sections were incubated with primary antibodies diluted 
in TNB blocking solution (0.1 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.5% blocking 
reagent from PerkinElmer) overnight at 4°C. Sections were then incubated for 
2.5h in secondary antibodies diluted in TNB. Sections were then incubated in 
streptavidin-horseradish peroxidase for 30min in TNB, which catalyses sub- 
sequent fluorescent conversion of tyramide substrates (all from PerkinElmer). 
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Antigen retrieval with proteinase K (10,gml ‘) or 0.01M citrate buffer at 
95 °C was used when necessary. 

The following antibodies were used in this study: GFAP (Chemicon), vimentin 
(Sigma), doublecortin (Chemicon), TuJ1 (Covance), PSA-NCAM (Genbiosys), 
Ki67 (DAKO), calretinin (Chemicon), calbindin (Chemicon), tyrosine hydroxylase 
(Chemicon) and EGFR (Upstate Biotechnology). 

Whole-mount dissection. Specimens were fixed in 4% PFA for 3 days, then rinsed 
in 0.1 M PBS. The anterior horn of the lateral ventricles was excised (3 mm squares) 
and rinsed in 0.1% PBS-Triton X-100. Tissue specimens were incubated in primary 
antibodies diluted in blocking solution (10% normal goat serum, 2% Triton X-100, 
0.1 M PBS) for 2 nights. This step is repeated for secondary antibodies incubation. 
DAPI was used (1:500) for counterstaining and Aqua Polymount (Polysciences) 
for mounting. The ventricular face of tissue specimens is microdissected and 
collected at 200-300 um thickness and mounted. 

In situ hybridization. Probes specific to human GFAP, doublecortin and EGFR 
were generated by amplifying fragments from commercially available cDNA 
clones (Open Biosystems) or reverse-transcribed cDNA from human fetal brain 
total RNA (Clontech). Primer sequences for amplifying GFAP (Allen Human 
Brain Atlas primer ID398603) and doublecortin (primer ID399287) were obtained 
from Allen Institute for Brain Science (http://www.brain-map.org/). Forward 
primer 5’-AGCTCTTCGGGGAGCAGCGA-3’ and reverse primer 5'-TGCAC 
GTGGCTTCGTCTCGG-3’ were used for EGFR. Amplified fragments were sub- 
cloned into pCRII (Invitrogen) and RNA probes were made subsequently using 
DIG RNA labelling (Roche). Sections were cut at 14 um, mounted on Superfrost 
Plus slides. The sections were pre-hybridized for 2 h and hybridized with probes at 
1:100 to 1:200 dilutions at 55 °C overnight. After hybridization, slides were washed 
and incubated with anti-DIG antibody, and developed by BM purple substrate for 
24 to 48h. Slides hybridization with GFAP and doublecortin RNA probes were 
subsequently immunostained with GFAP and doublecortin antibodies as previ- 
ously described. 

Electron microscopy. Specimens fixed in 2% glutaraldehyde and 2% paraformalde- 
hyde were cut into 200-mm sections on a vibratome. Sections were post-fixed in 2% 
osmium, rinsed, dehydrated and embedded in Araldite (Durcupan, Fluka). To study 
subventricular zone architecture, we cut serial 1-mm semi-thin sections and stained 
them with 1% toluidine blue. To identify individual cell types, ultra-thin (0.05-mm) 
sections were cut with a diamond knife, stained with lead citrate and examined under 
a Jeol 100CX electron microscope. 
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Primary motor cortex underlies multi-joint 
integration for fast feedback control 


J. Andrew Pruszynski!”*, Isaac Kurtzer>**, J oseph Y. Nashed!, Mohsen Omrani!, Brenda Brouwer!* & Stephen H. Scott!>® 


A basic difficulty for the nervous system is integrating locally 
ambiguous sensory information to form accurate perceptions 
about the outside world’~*. This local-to-global problem is also 
fundamental to motor control of the arm, because complex mech- 
anical interactions between shoulder and elbow allow a particular 
amount of motion at one joint to arise from an infinite combination 
of shoulder and elbow torques’. Here we show, in humans and rhesus 
monkeys, that a transcortical pathway through primary motor cor- 
tex (M1) resolves this ambiguity during fast feedback control. We 
demonstrate that single M1 neurons of behaving monkeys can integ- 
rate shoulder and elbow motion information into motor commands 
that appropriately counter the underlying torque within about 50 
milliseconds of a mechanical perturbation. Moreover, we reveal a 
causal link between M1 processing and multi-joint integration in 
humans by showing that shoulder muscle responses occurring ~50 
milliseconds after pure elbow displacement can be potentiated by 
transcranial magnetic stimulation. Taken together, our results show 
that transcortical processing through M1 permits feedback res- 
ponses to express a level of sophistication that rivals voluntary con- 
trol; this provides neurophysiological support for influential 
theories positing that voluntary movement is generated by the 
intelligent manipulation of sensory feedback’. 

Extensive research has shown that some of our fastest motor re- 
actions express a degree of sophistication that rivals voluntary 
actions*”°, but little is known about the neural substrates that underlie 
this sophistication’'. The present monkey and human studies test the 
hypothesis that primary motor cortex (M1) provides a neural substrate 
for integrating shoulder and elbow motion information for fast feedback 
control, a key ability for generating fast and accurate corrections*'*”. 
Misa prime candidate to mediate this ability because: (1) it forms part 
of a transcortical feedback pathway, giving it access to the required 
afferent information’*"*; (2) it is a key node for voluntary control, which 
appropriately incorporates shoulder and elbow information when 
generating commands for voluntary actions'*’’; and (3) influential 
theories posit that voluntary movement involves the sophisticated 
manipulation of sensory information’, suggesting substantial functional 
and anatomical overlap between voluntary and feedback control’. 

We first determined whether individual neurons in monkey M1 
exhibit a pattern of activity consistent with multi-joint motion integ- 
ration during fast feedback control. Two male rhesus monkeys were 
trained to counter unpredictable step-torque perturbations applied at 
the shoulder and/or elbow which displaced their hand from a central 
target. To receive water reward, the monkeys needed to return their 
hand to the target within 750 ms and remain within it for an additional 
3 s, allowing us to analyse both fast feedback responses (<100 ms post- 
perturbation) and steady-state motor outputs (last 1 s of stabilization) 
produced in response to the applied torque within the same trial. 

Because our experimental scheme (Fig. 1a) was specifically designed 
to examine sensorimotor control of the shoulder joint, we were 


principally interested in neurons whose steady-state motor outputs 
varied with the exerted shoulder torque (that is, shoulder-like neu- 
rons). As in our previous studies, we found that the population of 
neurons was biased towards combined shoulder and elbow torques 
(Rayleigh test for bimodality, P< 0.05), making shoulder-like neurons 
relatively rare’’. In total, 25 of 356 M1 neurons were categorized as 
shoulder-like because they exhibited significant directional tuning to 
steady-state loads (plane-fit, P< 0.05) and a preferred torque direction 
within 15° of either shoulder flexion or shoulder extension torque 
(Supplementary Fig. 1). 

The key question is how quickly shoulder-like neurons become 
selectively tuned to shoulder torque following an unexpected torque 
perturbation. This is not a mere restatement of our selection criteria 
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Figure 1 | Experimental methods. a, Because of the mechanical properties of 
the limb, an infinite combination of possible shoulder and elbow torque 
perturbations can cause the same resultant shoulder motion. Generating the 
shoulder motor response which counters the applied shoulder torque 
perturbation requires integrating elbow information. b, Limb configuration 
before (unfilled) and after (filled) a torque perturbation was applied at either the 
shoulder (t,) or elbow (t,) as in experiment 1. Opposite conditions (shoulder- 
extensor / elbow-flexor torque) not shown. c, Joint displacement resulting from 
the shoulder (red) and elbow (blue) perturbation conditions in b. The 
perturbations yielded similar shoulder motion but substantially different elbow 
motion. Solid lines represent the mean displacements and the grey lines show 
individual trials. d, Limb configuration before and after a multi-joint flexion or 
multi-joint extension torque perturbation as in experiment 2. e, The 
perturbations shown in d caused substantial elbow motion but almost no 
shoulder motion. 
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because local shoulder information is sufficient for countering the 
underlying shoulder torque in the steady-state. In contrast, the only 
way that fast feedback responses can account for the ambiguous rela- 
tionship between local joint motion and global torque is by integrating 
information from both the shoulder and elbow. 

The need to resolve ambiguous motion information for fast feed- 
back control is exemplified in our first experiment, where we applied 
either shoulder torque or elbow torque perturbations (Fig. 1b). These 
perturbations caused substantially different amounts of elbow motion 
but nearly identical shoulder motion (Fig. 1c). If shoulder-like neurons 
integrate both shoulder and elbow motion information, then they 
should differentiate between the two conditions and respond more 
strongly to the shoulder torque perturbation than the elbow torque 
perturbation. Figure 2a presents an exemplar neuron which follows 
this pattern. It was maximally active during steady-state compensation 
of shoulder-extension torque, and it responded more strongly to the 
shoulder-extension torque perturbation than the elbow-flexion torque 
perturbation (t-test, tg = 2.2, P<0.05) within ~60 ms of perturba- 
tion onset. Moreover, the population of shoulder-like neurons (Fig. 2b, 
Supplementary Fig. 2a) also quickly expressed greater activity for 
shoulder torque perturbations than elbow torque perturbations 
(paired t-test, to4 = 2.7, P< 0.01; 15 out of 20 neurons in monkey P 
and 4 out of 5 neurons in monkey X show the expected trend). 

Our hypothesis makes the additional prediction that different 
amounts of inhibition should occur to torque perturbations that are 
opposite to a neuron’s steady-state preference. That is, a neuron 
which is maximally active during steady-state compensation of 
shoulder-extension torque should quickly express more inhibition to 
shoulder-flexion torque perturbations than to elbow-extension torque 
perturbations. This prediction was verified across the population 
(paired t-test, t,4 = 2.1, P< 0.05), demonstrating that shoulder-like 
neurons possess a pattern of multi-joint integration appropriate for 
both excitatory and inhibitory perturbations (Supplementary Fig. 2a). 
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Figure 2 | Neurons in primary motor cortex. a, Responses of an exemplar 
shoulder-like neuron to either an elbow (blue) or shoulder (red) torque 
perturbation in experiment 1 where shoulder or elbow torque perturbations 
yielded the same shoulder motion. Data aligned on perturbation onset. Dots 
represent single action potentials and the trace depicts the average response. 
b, Same format as a but representing the population response across shoulder- 
like neurons. The arrows indicate when neural activity is non-specific to the 
underlying shoulder torque perturbation and then when it begins to respond 
appropriately to the underlying shoulder torque. c, d, Same format as a and 
b but for experiment 2 where combined shoulder and elbow torque 
perturbations yielded no shoulder motion. 
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Another situation where the nervous system must resolve locally 
ambiguous information is exemplified in our second experiment, in 
which torque perturbations at both the shoulder and elbow cause 
substantial elbow motion but no shoulder motion (Fig. 1d,e). If fast 
feedback responses of shoulder-like neurons appropriately integrate 
shoulder and elbow motion to counter the underlying torque then they 
should respond to this perturbation, even though the shoulder joint is 
not displaced and no local shoulder sensor (in the muscle, joint or skin) 
can signal the event. Indeed, the exemplar neuron (Fig. 2c) increased 
its activity within ~60 ms of pure elbow extension motion, which is 
appropriate for countering the underlying shoulder extensor torque 
and consistent with its response in our first experiment (f-test, 
tig =5.1, P<10 *). The population of shoulder-like neurons also 
quickly expressed the predicted response pattern (Fig. 2d, Supplemen- 
tary Fig. 2b; paired t-test, th4 = 4.4, P< 10°; 15 out of 20 neurons in 
monkey P and 4 out of 5 neurons in monkey X show the expected 
trend). 

The above analysis established that M1 neurons integrate shoulder 
and elbow motion to counter the underlying torque perturbation 
within a binned epoch from 50 to 100 ms after perturbation onset 
(Fig. 3a). The response of shoulder-like neurons in this epoch 
paralleled the response of monkey shoulder muscles in the same epoch 
(Fig. 3b), suggesting that M1 contributes to the observed muscle activity. 
To provide further evidence of a functional link, we calculated the 
temporal evolution of multi-joint integration for both neurons and 
muscles using a receiver operating characteristic (ROC) analysis”. 
We found that multi-joint integration occurred in the population of 
MI neurons 8-20 ms before it occurred in muscles (Fig. 3c). Although 
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Figure 3 | Population analysis of neurons and muscles. a, Binned response 
(50-100 ms post-perturbation) across shoulder-like neurons. For Experiment 1, 
the red and blue bars represent responses to shoulder and elbow torque 
perturbations, respectively. For Experiment 2, the red and blue bars depict 
responses to pure elbow motion caused by a torque perturbation aligned with or 
opposite to the neuron’s steady-state preference, respectively. Error bars, s.e.m.; 
*significant differences between conditions (paired t-test, P< 0.05); 
tsignificant differences from baseline; NS, not significant. b, Same format as 
a but for the population of muscles. Because of the normalization procedure, 
muscle baseline activity is 0 arbitrary units (a.u.). c, Average discrimination 
probability over time for the population of neurons and muscles as determined 
by ROC analysis. Conditions are collapsed across experiments such that the 
vertical axis is a metric of multi-joint integration. On average, the neurons led 
the muscles by ~19 ms as estimated by the mean temporal difference between 
the neural and muscle ROC curves (AT) from 50 to 100 ms post-perturbation 
(shaded area). d, Same format as Fig. 2d but for elbow-like neurons. 
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this temporal lead is substantially shorter than typical measurements of 
M1-to-muscle latency at the initiation of voluntary movement 
(~60 ms)”, it is consistent with the known conduction delay between 
MI neurons and muscles of the monkey upper-limb*”’. 

Interestingly, shoulder-like M1 neurons did not immediately 
account for the limb’s mechanical properties, displaying a non-specific 
response to the torque perturbations from ~20 to 50 ms following 
perturbation onset (Fig. 2b, d). A similar non-specific response was 
observed across our whole population of M1 neurons (Supplementary 
Fig. 2c,d). For example, neurons which preferentially responded to 
steady-state elbow torque (elbow-like neurons, Supplementary Fig. 1) 
initially exhibited the same response whether the perturbation flexed or 
extended the elbow joint as in our second experiment (Fig. 3d). This 
non-specific response is strikingly similar to a population of neurons in 
primary visual cortex (V1), which initially respond ambiguously to 
objects placed in their receptive field and become sensitive to motion 
direction only after 20-30 ms (ref. 2), a delay attributed to interactions 
among V1 neurons”. The temporal evolution of multi-joint integration 
that we observe may also reflect processing intrinsic to M1 or it may be 
caused by delayed contributions from other neural structures, such as 
somatosensory cortex and cerebellum; this is an important issue that 
warrants further investigation. 

Although the activity of single neurons in monkeys provides 
evidence that M1 is functionally linked to multi-joint integration for 
fast feedback control, the above data are ultimately correlational and 
cannot establish whether M1 causes the co-varying pattern of shoulder 
muscle activity. We addressed this issue by directly influencing the 
processing of M1 in human participants while they generated feedback 
corrections similar to those in the monkey study. Applying a single 
pulse of transcranial magnetic stimulation (TMS) over M1 will excite 
its intrinsic circuits and, thereby, evoke a burst of muscle activity. 
When TMS is applied in conjunction with a joint perturbation, the 
response in the stretched muscle is much larger than the linear sum of 
the response to TMS alone and the perturbation alone****. These 
supra-linear effects—previously demonstrated in the finger, wrist 
and elbow muscles—occur only when TMS is timed to evoke a res- 
ponse >50 ms after the perturbation, suggesting that the two stimuli 
interact through a common cortical circuit and that feedback control 
at latencies >50 ms reflects processing in M1. We established the 
validity of this technique for shoulder muscles, as a supra-linear res- 
ponse occurred when TMS was delivered ~65 ms after the shoulder 
muscle was stretched (that is, during the long-latency reflex; t-test; 
extensor: to = 6.7, P< 10°; flexor: tg = 6.0, P< 10 °) but not 
25 ms after the muscle was stretched (that is, during the short-latency 
reflex; extensor: ff = —0.5, P>0.5; flexor: tg =0.5, P>0.5), when 
only spinal processes could contribute (Supplementary Fig. 3, left 
column). 

The critical question is whether M1 causally contributes to multi-joint 
integration for fast feedback control. We tested this hypothesis by apply- 
ing TMS in conjunction with the torque perturbation that causes pure 
elbow displacement (Fig. 1d, e). Any supra-linearity of the shoulder 
muscle response in this condition must reflect afferent information from 
the elbow joint onto cortical circuits controlling shoulder muscles, 
because local shoulder afferents are not physically affected by pure 
elbow motion. The predicted supra-linear effect was observed for both 
shoulder flexors and extensors (Fig. 4, Supplementary Fig. 3, right 
column) with TMS delivered 65 ms after perturbation onset (extensor: 
ty = 3.8, P< 0.01; flexor: tg = 5.3, P<10 °) but not 25 ms after per- 
turbation onset (extensor: tp = —2.7, P>0.5; flexor: tg = —0.1, 
P>0.5). The observed supra-linearity probably reflects latency- 
specific engagement of M1 rather than a general change in motor 
neuron excitability, because we found no correlation between the mag- 
nitude of perturbation-evoked activity and the amount of supra- 
linearity at either latency (P>0.1, Supplementary Fig. 4). Taken 
together, these results provide strong evidence that M1 causally under- 
lies multi-joint integration for fast feedback control. 
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Figure 4 | TMS and perturbation evoked activity in human shoulder 
muscles. a, Response evoked in an exemplar shoulder muscle (posterior 
deltoid) when a mechanical perturbation or TMS was applied in isolation. Note 
that the mechanical perturbation yielded a robust long-latency reflex and that 
the TMS yielded substantial evoked activity. b, Observed response (orange) and 
linear prediction (sum of responses in a, black) when the mechanical 
perturbation and TMS were applied in the same trial. c, Group muscle response 
(mean + s.d.) when TMS was paired with the perturbation normalized by the 
sum of their separate effects (Enorm = Erms,pert/ (Erms + Epert). Values above 1 
indicate supra-linearity; *significant supra-linearity (t-test, P< 0.05). 


Previous studies have demonstrated that fast feedback responses in 
Mare scaled by task-constraints, such as movement amplitude”, sur- 
face texture” and intended vigour'*”*. Our results show that M1 also 
integrates locally-ambiguous motion information into a global response 
that accounts for the limb’s mechanical properties, a more complex 
capability that is central to successfully guiding whole-arm move- 
ments”. It is well established that the voluntary motor system accounts 
for the mechanical properties of the limb and that this capability is 
expressed in the activity of M1 neurons”. We have previously argued 
that the functional similarity of voluntary and feedback control is not 
an accident and probably arises because of a common neural imple- 
mentation that includes M1’. This expectation is consistent with recent 
theories of sensorimotor control, which posit that voluntary behaviour 
involves the sophisticated manipulation of sensory information’. If our 
suggestion is true, then feedback processing in M1 should possess all 
the capabilities of voluntary processing in M1 and, likewise, studying 
feedback processing may provide a useful window into voluntary 
control. 


METHODS SUMMARY 


Studies were approved by the Queen’s University Research Ethics Board and 
Animal Care Committee. Ten human subjects gave informed consent and com- 
pleted the experiments as previously described*. Two monkeys (Macaca mulatta, 
~10kg) performed a similar experimental scheme in a miniaturized version of the 
same apparatus with ~10X smaller loads (KINARM, BKIN Technologies)”. 
Monkeys did not counter a pre-perturbation background load, and were exposed 
to eight randomly-interleaved step-torque perturbations, four used in experiment 1 
and two used in experiment 2. All eight conditions were used to calculate steady- 
state tuning by performing a planar regression on the neural activity when the 
monkey had re-stabilized its hand at the central target'®. 

Neural recordings were performed with single electrodes and processed accord- 
ing to standard techniques'*”’. Monkey muscle activity was acquired from mono- 
articular shoulder muscles (anterior/middle/posterior deltoid, pectoralis major; 
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n= 34) using fine-wire electrodes. Human experiments used surface electrodes 
(posterior deltoid, pectoralis major; n = 19). Population responses for both neurons 
and muscles were collapsed across shoulder flexion and extension conditions 
according to their predicted excitatory and inhibitory effects (for example, shoulder- 
flexion torque perturbations were excitatory for shoulder extensor muscles/neurons 
and inhibitory for shoulder flexor muscles/neurons). 

Single pulses of TMS (MES-10, Cadwell) were applied over left M1 with a 
posterior orientation of 30-45°. Placement/orientation of the double coil was 
chosen to evoke the largest response from the muscle of interest, ~4.5 cm lateral 
from vertex. Stimulation magnitude (40-50% of maximum) was selected to deliver 
the smallest-possible consistent response (evoked activity on seven consecutive 
trials) when the shoulder muscle countered a steady-state load. TMS only, per- 
turbation only and combined TMS and perturbation trials were randomly inter- 
leaved. TMS was timed to evoke shoulder muscle activity either ~25 ms or ~65 ms 
after perturbation onset. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Participants and apparatus. The studies presented were approved by the Queen’s 
University Research Ethics Board. All monkey ( = 2, Macaca mulatta, ~10kg, 
male) procedures were approved by the Queen’s University Animal Care 
Committee. Human subjects (6 females, 4 males, median age = 27) were neuro- 
logically unimpaired, had normal/corrected vision and provided informed con- 
sent. Human and monkey experiments were performed using different versions of 
the same robotic exoskeleton (KINARM, BKIN Technologies) which allows com- 
bined flexion and extension movements of the shoulder and elbow in the hori- 
zontal plane and can independently apply mechanical loads to the shoulder and/or 
elbow”. Target lights and simulated hand feedback were presented in the hori- 
zontal plane of the task via a virtual reality display, and direct vision of the hand 
was limited either by a physical barrier (humans) or by a lack of ambient light 
(monkeys). 

Neural, muscle and kinematic recordings. Recording chambers were surgically 
implanted under inhalation anaesthetic, and neural recordings were performed 
according to standard techniques'*”’. Single tungsten microelectrodes (FHC) were 
advanced until neural activity was observed. Individual neurons were then isolated 
and neural activity was recorded from those neurons with clear responses to either 
passive or active movements of the shoulder and/or elbow. Neurons which primarily 
responded to motion of the wrist or fingers were not recorded. Neurons recorded in 
the task (n = 356) were located in the rostral bank of the central sulcus as well as 
more superficial sites where previous mapping efforts showed that trains of electrical 
stimulation (11 pulses, 333 Hz, 0.2 ms pulse width, <50 A) could elicit shoulder 
and/or elbow movement. Post-mortem histology confirmed that recording sites 
from monkey P were located in M1. 

Monkey muscle activity was acquired from mono-articular shoulder muscles 
(anterior deltoid, middle deltoid, posterior deltoid, pectoralis major; n = 34) using 
fine-wire electrodes. Electrodes consisted of two single-strand wires and were 
individually inserted into the muscle belly spaced ~5 mm apart. Insertion was 
guided by anatomical landmarks and was confirmed by microstimulation. Human 
experiments used surface electrodes (Bortec AMT-8) and focused on those mono- 
articular muscles which could be readily recorded from the surface (posterior 
deltoid, pectoralis major; n = 19). Muscle activity was recorded at either 4 kHz 
(monkey) or 1 kHz (human), aligned on perturbation onset and full-wave rectified 
before analysis’. Only those muscles with clear phasic responses to the mechanical 
perturbation were analysed. Kinematic data and applied torques were acquired 
directly from the KINARM device and were sampled at the same rate as the muscle 
activity. 

Transcranial magnetic stimulation. We followed standard procedures in the 
TMS portion of the study****. Single pulses of TMS (MES-10, Cadwell) were 
applied over left M1 with a posterior orientation of 30-45°. Placement and orienta- 
tion of the double cone coil were chosen to evoke the largest response from the 
muscle of interest, ~4.5 cm lateral from vertex. Stimulation magnitude was selected 
to deliver the smallest-possible consistent response (evoked response on seven 
consecutive stimulations, average of 40% and 51% of the stimulator’s maximum 
output for the posterior deltoid and pectoral major, respectively) when the muscle 
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of interest actively countered a 3 N m background load (that is, active motor thresh- 
old). 

Experimental scheme. The experimental procedure and logic have been previ- 
ously described*. The major difference in the human portion of this study was the 
parallel implementation of TMS. Briefly, subjects stabilized their hand in a small 
central target while countering a steady state shoulder torque (3 Nm) which 
activated either the shoulder flexor or extensor muscles. After a random hold time 
(1-4), an unpredictable torque pulse (100 ms duration) was introduced and the 
trial ended when the subjects re-stabilized in the target for 500 ms. 

In total, four torque perturbations were used in the human study. Two single- 
joint torque perturbations (3 Nm shoulder-flexion for shoulder flexor muscles 
and 3N m shoulder-extension for shoulder extensor muscles) made up experi- 
ment 1, and two multi-joint torque perturbations (3 N m shoulder-flexion/3 N m 
elbow-flexion for shoulder flexor muscles; 3Nm_ shoulder-extension/3 Nm 
elbow-extension for shoulder extensor muscles) made up experiment 2. 
Perturbation only, TMS only and combined TMS and perturbation trials were 
randomly interleaved. In combined TMS and perturbation trials, the TMS was 
timed to evoke shoulder muscle activity either ~25 ms or ~65 ms after perturba- 
tion onset. 

The perturbation, TMS placement and TMS intensity were chosen for the 
shoulder flexor muscle and shoulder extensor muscle in two successive blocks. 
Half the subjects began with the conditions for the shoulder flexor muscle and half 
the subjects began with the conditions for the shoulder extensor muscle. Thirty 
repeats of the 14 conditions were collected for a total of 420 trials in a session that 
lasted about 2.5 h. 

Monkeys performed a similar scheme with ~10X smaller loads. Unlike 
humans, the monkeys did not counter a pre-perturbation background load and 
they were exposed to eight randomly-interleaved step-torque perturbations 
({shoulder torque, elbow torque], applied flexion/extension = positive/negative: 
1. [0.28Nm, ONm], 2. [0.24, 0.24], 3. [0, 0.24], 4. [—0.2, 0.2], 5. [—0.28, O], 6. 
[-0.24, —0.24], 7. [0, —0.24], 8. [0.2, —0.2]) and catch trials where no perturba- 
tions occurred. Four of these perturbations (1,3,5,7) formed experiment 1 and two 
(2,6) formed experiment 2. All eight conditions were used to calculate the steady- 
state tuning of each neuron by performing a planar regression on the neural 
activity when the monkey had re-stabilized its hand at the central target'*. To 
receive water reward, the monkeys needed to return their hand to the target within 
750 ms and remain within it for an additional 3 s, allowing us to analyse both fast 
feedback responses (<100 ms post-perturbation) and steady-state motor outputs 
(last 1 s of stabilization) to the applied torque within the same trial. Five to twenty 
repeats were collected per experimental condition. 

Population responses for both muscles and neurons were calculated by collaps- 
ing across shoulder flexion and extension conditions according to their predicted 
excitatory and inhibitory effects. That is, applied shoulder-flexion torque pertur- 
bations were excitatory for shoulder extensor muscles/neurons and inhibitory for 
shoulder flexor muscles/neurons. Applied shoulder-extension torque perturba- 
tions were excitatory for shoulder flexor muscles/neurons and inhibitory for 
shoulder extensor muscles/neurons. 
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Human induced pluripotent stem cells (iPSCs) represent a unique 
opportunity for regenerative medicine because they offer the pro- 
spect of generating unlimited quantities of cells for autologous 
transplantation, with potential application in treatments for a 
broad range of disorders'*. However, the use of human iPSCs in 
the context of genetically inherited human disease will require the 
correction of disease-causing mutations in a manner that is fully 
compatible with clinical applications**. The methods currently 
available, such as homologous recombination, lack the necessary 
efficiency and also leave residual sequences in the targeted genome’. 
Therefore, the development of new approaches to edit the mam- 
malian genome is a prerequisite to delivering the clinical promise of 
human iPSCs. Here we show that a combination of zinc finger 
nucleases (ZFNs)’ and piggyBac*” technology in human iPSCs 
can achieve biallelic correction of a point mutation (Glu342Lys) 
in the a,-antitrypsin (A1AT, also known as SERPINA1) gene that 
is responsible for a,-antitrypsin deficiency. Genetic correction of 
human iPSCs restored the structure and function of A1AT in sub- 
sequently derived liver cells in vitro and in vivo. This approach is 
significantly more efficient than any other gene-targeting tech- 
nology that is currently available and crucially prevents contamina- 
tion of the host genome with residual non-human sequences. Our 
results provide the first proof of principle, to our knowledge, for the 
potential of combining human iPSCs with genetic correction to 
generate clinically relevant cells for autologous cell-based therapies. 

At present, available methods for gene targeting rely on positive 
selection to isolate rare clones that have undergone homologous 
recombination. To remove the unwanted selection cassettes, Cre/ 
loxP or Flp/FRT recombination systems are used, which leave behind 
single loxP or FRT sites'®"’. These small ectopic sequences have the 
potential to interfere with transcriptional regulatory elements of sur- 
rounding genes’*, most of which are not fully characterized in the 
human genome. An alternative method to remove selection cassettes 
is to convert them into transposons. The most suitable transposon for 
this purpose is piggyBac, a moth-derived DNA transposon, which can 
transpose efficiently in mammalian cells including human embryonic 
stem (ES) cells”’*. A remarkable feature of this mobile element is 
seamless excision, which enables the removal of transgenes flanked 
by piggyBac inverted repeats without leaving any residual sequences”. 

To explore the use of piggyBac for the correction of point mutations, 
we designed a vector to correct an albino mutation (G290T sub- 
stitution in the Tyr gene) in mouse iPSCs isolated from fibroblasts 
of the C57Bl6-Tyr®®” strain'®. The targeting vector was constructed, 
carrying a wild-type 290G sequence and a PGK-puroAtk cassette 


flanked by piggyBac repeats into the TTAA site (Fig. 1a). After isolation 
of targeted clones, the selection cassette was excised from the mouse 
iPSC genome by transient expression of the piggyBac transposase and 
subsequent 1-(2-deoxy-2-fluoro-f-D-arabinofuranosyl)-5-iodouracil 
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Figure 1 | Correction of the G290T mutation in the Tyr gene in mouse 
iPSCs. a, The strategy for precise genome modification using the piggyBac 
transposon. Top line, structure of the Tyr gene; red line, 5’ external probe for 
Southern blot analysis; open arrow, piggyBac transposon carrying a PGK- 
puroAtk cassette; B, BamHI; E, EcoNI; P1, P2 and P3, PCR primers. 

b, c, Southern blot (b) and PCR analyses (c) showing insertion (c/PB) and 
excision (c/Rev) of the piggyBac transposon. ES, mouse ES cells as a control. 
d, e, Sequence analyses revealed correction of the G290T mutation (d) and 
seamless excision of the piggyBac transposon (e). Note that two silent mutations 
(A and T, indicated by arrowheads) introduced near the TTAA site were also 
detected. f, A chimaeric mouse generated by injecting corrected Tyr “**” mouse 
iPSCs (left) shows black coat colour. Right, a non-injected albino mouse. 
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(FIAU) selection. Genomic modification was verified by Southern blot 
and polymerase chain reaction (PCR) analyses (Fig. 1b, c). The cor- 
rection of the G290T mutation and seamless piggyBac excision were 
confirmed by sequence analyses (Fig. 1d, e). Two introduced silent 
mutations were observed, confirming that the T290G substitution 
was mediated by gene correction, not by spontaneous reversion 
(Fig. le). The function of the reverted allele was tested by injecting 
the corrected mouse iPSCs into albino mouse blastocysts. The result- 
ing chimaeric mice had a black coat colour, indicating phenotypic 
correction of the albino mutation (Fig. 1f). These results collectively 
demonstrate that the piggyBac transposon can be used as a versatile 
tool for highly precise modification (for example, correction or muta- 
tion) of the mammalian genome at a single base-pair level. 

We next explored whether this approach could be used to correct a 
mutation in human iPSCs derived from individuals with «-antitrypsin 
deficiency (ALATD)’®. A1LATD is an autosomal recessive disorder 
found in 1 out of 2,000 individuals of North European descent and 
represents the most common inherited metabolic disease of the 
liver'”!®. It results from a single point mutation in the A1AT gene 
(the Z allele; Glu342Lys) that causes the protein to form ordered 
polymers within the endoplasmic reticulum of hepatocytes'”"*. The 
resulting inclusions cause cirrhosis for which the only current therapy 
is liver transplantation. The increasing shortage of donors and harmful 
effects of inmunosuppressive treatments impose major limitations on 
organ transplantation, making the potential of human iPSC-based 
therapy highly attractive. Because homologous recombination is rela- 
tively inefficient in human ES cells° we used ZFN technology, which 
stimulates gene targeting in human ES cells as well as human 
iPSCs”'°”. ZEN pairs were designed to specifically cleave the site of 
the Z mutation (Fig. 2a—c, Supplementary Table 1 and Supplementary 
Note). A targeting vector was constructed from isogenic DNA with 
piggyBac repeats flanking the PGK-puroAtk cassette (Fig. 2a). To 
minimize the distance between the mutation and the piggyBac trans- 
poson, a CTG leucine codon, 10-bp upstream of the mutation, was 
altered to a TTA leucine codon, generating the TTAA sequence, which 
would be left in the genome following piggyBac excision (Fig. 2b). 

Puromycin-resistant human iPSC colonies obtained after co- 
electroporation of ZFN expression vectors and the targeting vector 
were screened for targeted clones by PCR. AlATD-iPSC lines derived 
from three different patients yielded targeted clones (Table 1). 
Remarkably, 54% of the puromycin-resistant colonies were targeted 
on one allele, whereas 4% were the result of simultaneous targeting of 
both alleles (Supplementary Fig. 1). 

To remove the piggyBac-flanked selection cassette from these 
modified clones, we transiently transfected two homozygously targeted 
clones (B-16 and C-G4) with a hyperactive form of the pigeyBac trans- 
posase® and subjected them to FIAU selection. The genotype of the 
resulting FIAU-resistant colonies was analysed by PCR and confirmed 
by Southern blot (Fig. 2d and Supplementary Fig. 2a). Biallelic excision 
was observed in 11% of FIAU-resistant colonies (Table 2). Sequence 
analyses demonstrated that the Z mutation was corrected on both alleles 
and that transposon excision yielded a TTAA sequence as initially 
planned (Fig. 2b, e and Supplementary Fig. 2b). The resulting corrected 
iPSC lines maintained the expression of pluripotency markers for more 
than 20 passages and their abilities to differentiate into cells expressing 
markers of the three germ layers (Supplementary Fig. 3), indicating that 
genome modification did not alter the pluripotency of corrected human 
iPSCs. 

Genomic instability is known to be associated with prolonged culture 
of human ES cells*°" and mutations arising during genome modifica- 
tion would be another concern for clinical application of human iPSCs. 
Therefore, we analysed the genomic integrity of the human iPSC lines 
using comparative genomic hybridization (CGH) (Supplementary 
Table 2a—c). Two out of three ALATD-iPSC primary lines differed from 
their parental fibroblasts, showing amplifications or deletions rang- 
ing from 20kb to 1.3 Mb, including a gain of 20q11.21, a frequently 
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Figure 2 | Correction of the Z mutation in human A1ATD-iPSCs. a, The 
strategy for precise genome modification using ZFNs and the piggyBac 
transposon. Top line, structure of the A1AT gene; blue lines, Southern blot 
probes; thin and thick boxes, non-coding and coding exons, respectively; open 
arrow, piggyBac transposon; A, AflIII; B, BamHI. b, Sequences of wild-type 
(Ref.), Z, PB and Rev alleles. Amino acid position 342 (blue), recognition sites 
for ZFNs (green) and piggyBac excision site (red) are shown. Sequence changes 
in Rev allele from Z allele are indicated by asterisks. c, Surveyor nuclease assay 
showing the cleavage of the Z mutation in ZFN-transfected K562 cells. Non- 
transfected cells were used as a control. NHEJ, non-homologous end joining. 
d, Southern blot analysis showing biallelic piggyBac insertion (B-16) and 
biallelic excision (B-16-C2, -C3 and -C6) during correction of the ALATD- 
iPSC line B. Genomic DNA was digested by BamHI (5’ and PB probes) or AIflII 
(3' probe). Genotype: PP, homozygous for insertion of piggyBac; RR, 
homozygous for reverted allele; ZZ, homozygous for Z allele. e, Sequence 
analysis showing correction of the Z mutation in three corrected iPSC lines. 
Wild-type sequence (top line) and A1ATD-iPSC sequence (second line). 


amplified region in human ES cells*”* (see Supplementary Analysis 
and Supplementary Fig. 4). Line A retained a normal genome content 
compared to its parental fibroblast. Reassuringly, we found that after 
ZFN-stimulated targeting, four out of six homozygous clones had 
unaltered genomes compared to their parental iPSC lines. Sixteen cell 


Table 1| Summary of PCR genotyping of ZFN-stimulated gene 
targeting 


A1ATD- Clones analysed Het.* | Homo./ Het. + Homo./Hemi.+ — Non- 
iPSC line Hemi. additional additional targeted§ 
integrationst integrations 
A 84 45 3 23 8 5 
B 18 10 2 3 3 0) 
Cll 216 112 9 52 21 22 
ean frequency (%) 54 6 23 12 5 


* Het., clones heterozygous for PB allele. 

Homo./Hemi., clones homozygous or hemizygous for PB allele. Cells with one targeted allele and 
deletion of the other allele are indistinguishable from correctly targeted homozygous clones by PCR. 
Such cells are designated as hemizygotes. 

{Vector backbone integration was analysed by PCR. 

§ Clones showing incorrect PCR bands are included. 

|Sum of two independent experiments. 
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Table 2 | Frequencies of biallelic piggyBac excision 


Biallelic excision without 
re-integration 


Biallelic excision with 
re-integration 


Cell line Clones Number of Frequency (%) Number of Frequency (%) 
analysed clones clones 

B-16 88 15 17 33 38 

C-G4 94 5 5 19 20 

Mean frequency (%) 11 29 


lines with biallelic piggyBac excision were compared with their corres- 
ponding primary iPSCs and 12 had unaltered genomes. We also ana- 
lysed the iPSC lines by SNP arrays to check for loss of heterozygosity 
and found that all lines analysed retained heterozygosity throughout 
their genome (Supplementary Fig. 5). This observation demonstrates 
that biallelic gene correction was the result of simultaneous homolog- 
ous recombination followed by simultaneous excision at both alleles 
and that mitotic recombination was not involved in this process. 

ZEN off-target cleavage and imprecise excision after multiple piggyBac 
transposition might introduce mutations into the genome. To investigate 
these possibilities at a single base-pair resolution, we sequenced exomes 
of the corrected B-16-C2 line and its parental fibroblast. Comparison of 
these exomes identified 29 mutations (Supplementary Table 3). The 
genesis of these mutations was determined by analysis of the primary 
iPSC line and the homozygously targeted intermediate. Twenty-four 
point mutations and one 1-bp deletion were detected in the primary 
iPSC line and four mutations arose during genetic correction: one 
during targeting and three during piggyBac excision. These mutations 
seemed to arise during culture as their genomic signatures were in- 
consistent with ZFN off-target sites or piggyBac integration sites 
(Supplementary Analysis). Taken together, we conclude that the com- 
bination of ZFNs with piggyBac provides a new method for rapid and 
clean correction of a point mutation in human iPSCs without affecting 
their basic characteristics. 

To confirm that the genetic correction of AlLATD-iPSCs resulted in 
the expected phenotypic correction, iPSCs were differentiated in vitro 
into hepatocyte-like cells, the main cell type affected by the disease 
A1ATD. Differentiation of the corrected lines occurred as expected, 
resulting in a near homogenous population of hepatocyte-like cells 
(Supplementary Fig. 6a—c). Remarkably, CGH analysis of differen- 
tiated cells showed that hepatic differentiation neither increases the 
number of genetic abnormalities nor selects for cells with abnormal 
karyotype (Supplementary Table 2d). The resulting cells shared key 
functional attributes of their in vivo counterparts including glycogen 
storage, low density lipoprotein (LDL)-cholesterol uptake, albumin 
secretion and cytochrome P450 activity (Supplementary Fig. 6d-g). 
Importantly, immunofluorescence and enzyme-linked immuno- 
sorbent assay (ELISA) both confirmed the absence of mutant polymeric 
AIAT in corrected iPSC-derived hepatocyte-like cells that instead effi- 
ciently secreted normal endoglycosidase-H-insensitive monomeric 
AIAT (Fig. 3a-d). In addition, secreted AIAT showed an enzymatic 
inhibitory activity that was comparable to that obtained from normal 
adult hepatocytes (Fig. 3e), thereby suggesting that physiological res- 
toration of enzyme inhibitory activity could be achieved. 

Lastly, the in vivo function of corrected iPSC-derived hepatocyte- 
like cells (B-C16-2 line) was assessed following transplantation into 
the liver of Alb-uPA*’*;Rag2 ‘~ ;Il2rg ‘~ mice via intrasplenic injec- 
tion. Livers harvested 14 days after injection were colonized by human 
cells identified using antibodies specific to human albumin and 
AIAT (Fig. 3f, g). These human hepatocyte-like cells were distributed 
throughout the liver lobes and were seen to be integrated into the 
existing mouse parenchyma (Fig. 3f, g). In addition, human albumin 
was detected in the serum of transplanted animals for at least 5 weeks 
(Fig. 3h), whereas no tumour formation was detected in any mice. 
Therefore, corrected iPSC-derived hepatocyte-like cells were able to 
colonize the liver in vivo and show functional activities characteristic of 
their human ES-cell-derived counterparts”. Collectively these analyses 
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Figure 3 | Functional analysis of restored A1AT in corrected iPSC-derived 
hepatocyte-like cells. a, Immunofluorescence showing the absence of 
polymeric A1AT protein in hepatocyte-like cells generated from corrected 
iPSCs. All forms of A1AT (left panels) and misfolded polymeric AIAT (middle 
panels) are shown. b, c, ELISA to assess the intracellular (b) and secreted 

(c) levels of polymeric A1AT protein in hepatocyte-like cells derived from 
AIATD-iPSCs (ZZ), corrected iPSCs (RR) and control human iPSCs (+ +). 
NS, not significant. d, Endoglycosidase H (E) and peptide:N-glycosidase (P) 
digestion of ALAT immunoprecipitated from uncorrected (ZZ), corrected (RR) 
and control (++) human iPSC-derived hepatocyte-like cells (upper panels) 
and corresponding culture medium (lower panels). e, Chymotrypsin ELISA 
showing that corrected cells (RR) have A1AT enzymatic inhibitory activity that 
is superior to uncorrected cells (ZZ) and close to adult hepatocytes. 

f, g, Immunofluorescence of transplanted liver sections detecting human 
albumin (f) and A1AT (g). DNA was counterstained with 4’,6-diamidino-2- 
phenylindole (DAPI). h, ELISA read-out of human albumin in the mouse 
serum longitudinally followed for each mouse. Asterisk, the mouse was 
subjected to histology analysis. Scale bars, 100 tm. Data in b, cand e are shown 
as mean + s.d. (n = 3). Student’s t-test was performed. 
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demonstrate that genetic correction of the Z mutation resulted in 
functional restoration of A1AT in patient-derived cells. 

All the experimental evidence described earlier strongly supports 
the applicability of genetic correction in patient-specific iPSCs for cell- 
based therapy of A1ATD. We therefore repeated the genetic correction 
in more clinically relevant cells using patient-specific iPSCs repro- 
grammed from fibroblasts with Sendai virus vectors, an integration- 
free method” (Supplementary Fig. 7a-f). One primary human iPSC 
line with an intact genome by CGH analysis (Supplementary Fig. 7e 
and Supplementary Table 4) was corrected by the method described 
earlier. The final product, iPSC-3-G5-A7, had the corrected AIAT, 
an intact genome compared to the parental fibroblast and expressed 
normal A1AT protein when differentiated to hepatocyte-like cells 
(Supplementary Fig. 8 and Supplementary Table 4). This is the first 
demonstration, to our knowledge, of the generation of mutation- 
corrected patient-specific iPSCs, which could realize the therapeutic 
promise of human iPSCs. 
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Here we demonstrate that ZFNs and piggyBac transposons enable 
simultaneous biallelic correction of diseased human iPSCs. No residual 
ectopic sequences remain at the site of correction and the genome seems 
to be undisturbed elsewhere. Although we could readily obtain cell lines 
without large genomic alterations during genetic modification, the 
resulting corrected human iPSCs carry 29 mutations in protein-coding 
exons, of which 22 were non-synonymous or splice site mutations. The 
probable impact of this mutation load needs to be considered in the 
context of the likely functional impact of the mutations, taking into 
account the normal germline load, accumulated somatic variation, the 
presence of compensating normal gene copies and the requirement for 
the gene product in the derived differentiated cells. From this point of 
view, only eight mutations might affect gene functions in hepatocyte- 
like cells (Supplementary Table 3). Nevertheless, the corrected iPSCs 
could efficiently differentiate to hepatocyte-like cells and engraft into 
the animal model for liver injury without tumour formation. Therefore, 
limited genomic abnormalities might have restricted biological con- 
sequences. Careful screening of primary and corrected human iPSCs 
using deep sequencing analyses would contribute to the safe use of 
human iPSCs in clinical applications. 

iPSCs derived from different patients were effectively corrected, 
demonstrating that this method could be applied to a large number 
of A1ATD-iPSC lines. Because the biallelic correction could be carried 
out in less than 4 months, our approach may be compatible with large- 
scale production of corrected patient-specific iPSCs not only for 
AI1ATD but also for other monogenic disorders. 


METHODS SUMMARY 


A1ATD-iPSCs were described previously'®. 210° human iPSCs were co- 
transfected with ZFN expression vectors and the donor template, and subjected 
to puromycin selection (1 jig ml‘) initiated 4 days after transfection. For trans- 
poson excision, targeted cells were transfected with pCMV-hyPBase’, cultured for 
4 days, re-plated and selected in 250nM FIAU. To increase clonogenicity, cells 
were treated with ROCK inhibitor’, Y-27632 (10 uM) 4h before dissociation and 
24h after plating. Resulting colonies were picked 2 weeks later, analysed by PCR 
and further verified by Southern blot analysis. Primer sequences are listed in 
Supplementary Table 5. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Plasmid construction. Gateway-adapted piggyBac transposon vectors: a destina- 
tion vector pPB-R1R2-NP was constructed as follows. The atfR1 and attR2 sites 
were PCR-generated and digested by Nhel/HindIII and Xhol/Spel, respectively. 
EM7-neo was PCR-generated and digested by HindIII/Xhol. These three frag- 
ments were then cloned into the Nhel-Spel site of pPB-LRS (ref. 27), resulting 
in pPB-R1R2-Neo. An EcoRI-Xbal fragment containing PheS was excised from 
pR6K-R1R2-ZP”**, blunt-ended and cloned into the blunted Xhol site of pPB- 
RIR2-Neo, resulting in pPB-R1R2-NP. An entry vector pENTR-PGKpurodAtk 
was constructed by cloning a KpnI-Notl PGK-puroAtk fragment into the Kpnl- 
Not! site of pENTR-2B. 

A targeting vector for Tyr: the targeting vector was constructed using BAC 
recombineering. A bacterial artificial chromosome (BAC) clone RP24-221M7 
was introduced into Escherichia coli strain EL350 (ref. 29). A mini targeting vector 
was first constructed to modify the Tyr gene on the BAC. Left and right homology 
arms were PCR-generated and digested by AscI/BsiW1 and Nsil/Pacl, respectively. 
The transposon fragment was excised from pPB-R1R2-NP by Nsil/BsiW]1 diges- 
tion. These three fragments were then cloned into the AscI/Pacl site of pMCS, 
resulting in pMCS-Tyr-NP. An AscI-Pacl fragment was excised from pMSC-Tyr- 
NP and used for BAC targeting. A retrieving vector was constructed by cloning 
PCR-generated left and right homology arms into the Xhol/Ascl site of pMSC- 
DTA, following AscI/HindIII and Xhol/HindIII digestion of the left and right arm, 
respectively. The retrieving vector was linearized by HindIII digestion and used to 
retrieve the 3.0-kb 5’ arm, the transposon and the 6.5-kb 3’ arm. Lastly, the Neo- 
PheS cassette was replaced with the PGK-puroAtk cassette by Gateway cloning, 
resulting in pDTA-Tyr’®. The targeting vector was linearized by AscI before 
electroporation into the albino miPSCs. 

A donor template vector for A1AT: a 2-kb fragment, which contained 1 kb on 
both sides of the Z mutation, was first PCR-amplified using genomic DNA from 
ALATD-iPSC line B asa template and cloned into pCR4-blunt-TOPO (Invitrogen), 
resulting in pCR4-AAT_Z. To construct a donor template with corrected sequence 
and a piggyBac transposon, the 5’ arm and 3’ arm were PCR-amplified and digested 
with AscI/Nsil and BsiWI/Pacl, respectively. The Nsil-BsiWI fragment containing 
a piggyBac transposon with the Neo-PheS cassette was excised from pPB-R1R2-NP. 
The digested fragments were cloned into the AscI-Pacl site of pMCS, resulting in 
pMCS-AAT-PB:NP. The Neo-PheS cassette was subsequently replaced with a PGK- 
puroAtk cassette by Gateway cloning, resulting in the final donor vector, pMCS- 
AAT-PB:PGKpuroAtk. 

The plasmids (pPB-RIR2-NP, pENTR-PGKpuroAtk, pMCS-AAT- 
PB:PGKpuroAtk) have been deposited in the Wellcome Trust Sanger Institute 
Archives and are available upon request (http://www.sanger.ac.uk/technology/ 
clonerequests/). 

Cell culture. Appropriate ethical approval and patient consent were obtained 
(Ethics reference no. 08/H0311/201; R&D no. A091485). ALATD-iPSCs (ref. 16; 
A, patient 2 line 1; B, patient 1 line1; C, patient 3 line 1) were cultured on mouse 
embryonic fibroblast (MEF)-feeder layers in human ES cell medium: DMEM/F12 
supplemented with 20% knockout serum replacement, 1 mM GlutaMax, 0.1 mM 
2-mercaptoethanol, 1X non-essential amino acids and 4 ng ml ' FGF2 (Invitrogen). 
Subculture was performed every 5-7 days by detaching human iPSCs by incubation 
in 0.5 mg ml ' dispase and 0.5 mg ml collagenase type IV for 1 h at 37 °C, collect- 
ing detached human iPSC colonies, breaking down into small clamps and plating 
them onto new feeder plates. MEFs (CF1 or B6129F1) were cultured in DMEM 
containing 10% FCS, 2mM glutamine, 0.1 mM 2-melcaptoethanol and 1X non- 
essential amino acids. Mouse iPSCs (iPS25A1; ref. 15) were cultured on MEF-feeder 
layers in mouse ES cell medium: KO-DMEM supplemented with 15% FBS, 1 mM 
GlutaMax, 0.1 mM 2-mercaptoethanol, 1 non-essential amino acids and 1,000 unit 
ml ! LIF (Millipore). 

Gene targeting and transposon excision in mouse iPSCs. 1 X 10’ cells were 
electroporated with 25 \g of a linearized targeting vector in 800 ul of HEPES- 
buffered saline using a Gene Pulser II electroporator (230 V, 500 LF) and plated 
onto three 10-cm dishes. The next day, puromycin selection (1 1gml~') was 
initiated. Resulting colonies were picked and screened by PCR. Targeted clones 
were expanded and further verified by Southern blot analysis. Correctly targeted 
clones were then subjected to transposon excision. 2 X 10° cells were electropo- 
rated with 40 jig of pCMV-hyPBase in 800 ul of HEPES-buffered saline using a 
Gene Pulser II electroporator (230 V, 500 LF) and plated onto one well of a 6-well 
plate. After passage once, cells were replated on day 4 at 5 X 10° cells per 10-cm 
dish. On the following day, FIAU (0.2 1M) selection was initiated. On day 5 of 
selection, FIAU was withdrawn. Resulting colonies were picked at day 7 and 
screened by PCR. Primer sequences to detect homologous recombination are 
shown in Supplementary Table 5. 

ZFN-mediated gene targeting in ALATD-iPSCs. On the day of electropolation 
(day 0), near-confluent cells were pre-treated with a ROCK inhibitor” (Y-27632, 
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Sigma) at 10 1M for 3-4 h before electroporation. Cells were then washed with PBS 
once, detached by Accutase (Millipore; 10 min at 37 °C) and mixed with DMEM/ 
F12 containing 10% FCS. Cells were dissociated into single-cell suspension by 
vigorous pipetting and counted. 2 X 10° cells were pelleted and mixed with 5 1g 
of a 5'-ZEN expression vector, 5 Ug of a 3’-ZEN expression vector and 2 1g of the 
donor template in 100 tl of human ES cell solution 1 (Lonza). The cell suspension 
was transferred to a cuvette and electroporated using the Amaxa Nucleofector 
device (Lonza) with program A23. The electroporated cells were plated onto one 
or two 10-cm feeder dishes in MEF-conditioned human ES cell medium contain- 
ing 10 uM Y-27632. Human ES cell medium without any drug was used for daily 
medium change between days 1-3. On day 4, puromycin selection (1 ug ml ') was 
started. On day 6, medium was changed to MEF-conditioned human ES cell 
medium containing 0.5 pg ml_' puromycin, which was used for medium change 
at every other day until picking colonies. Resulting colonies were picked on day 
13-17. Colonies were cut into two pieces. One half was transferred onto one well of 
a 24-well feeder plate and the other half was lysed and used for PCR genotyping. 
PCR-positive clones were further expanded and homologous recombination was 
verified by Southern blot analysis. 

Transposon excision in homozygously targeted human iPSCs. Homozygously 
targeted clones (B-16, C-G4, SeV-1-C3 and SeV-3-G5) were used for transposon 
removal. Line-A-derived clones were omitted because this line showed a lower 
capability of differentiating into endodermal lineages. Cells were prepared as 
described earlier. 2 10° cells were mixed with 10,1g of the hyperactive 
piggyBac transposase expression vector (pCMV-hyPBase*) in 100 ul of human 
ES cell solution 1 and electroporated using the Nucleofector device with the 
program A23. Electroporated cells were plated onto a 6-well plate in 1:2, 1:3 
and 1:6 dilutions in MEF-conditioned human ES cell medium containing 
10 uM Y-27632. Note that ROCK inhibitor was added to the culture medium until 
day 6 in this experiment. On day 2, cells with ~80% confluency were passaged 
using Accutase at a split ratio of 1:2, 1:3 and 1:6 into 6-well plates. On day 4, cells 
with ~80% confluency were washed with PBS, detached with Accutase, suspended 
in human ES cell medium and pelleted. Cells were resuspended in human ES cell 
medium into single-cell level and counted. 1 X 10* cells were then plated onto one 
10-cm dish in human ES cell medium containing 10 1M Y-27632. 16-18h after 
plating (day 5), medium was changed to human ES cell medium containing 
0.25 uM FIAU and 101M Y-27632. On day 6, medium was changed to human 
ES cell medium containing 0.25 uM FIAU and then medium was changed every 
other day. Genotype and deletion of the piggyBac transposon were analysed by 
PCR and further verified by Southern blot analysis. 

CGH analysis. Genomic DNA was extracted using a DNeasy kit (Qiagen). Agilent 
244K human genome arrays were used following the manufacturer’s protocol. The 
arrays were scanned with an Agilent microarray scanner and data were generated 
by Agilent Feature Extraction software. CGH calls were made with Agilent’s DNA 
analytics software using the ADM2 algorithm (6.0 threshold) with a minimum of 5 
probes in the region as a filter. 

SNP analysis. An Illumina HumanCytoSNP-12 SNP array was used following the 
manufacturer’s protocol. Genotype calls were performed by Illumina’s GenomeStudio. 
B allele frequency and log R ratio were analysed by KaryoStudio. CNVpatition 
v2.4.4 bundled in KaryoStudio was used for copy number analysis. 

ZEN design. ZENs were designed against a region containing the Z mutation in 
the A1AT gene (see Fig. 2a, b) and assembled as previously described”. The amino 
acid residues at positions ‘— 1’ to ‘+6’ of the recognition o-helix’'”’ of each of the 
zinc finger DNA-binding domains for each DNA triplet target are shown in 
Supplementary Table 2. The ZFNs were linked to the wild-type Fokl catalytic 
domain. The activity of the ZFN at the endogenous target site was determined 
using the Surveyor Nuclease assay as previously described”’. 

Human iPSC-derived hepatocyte-like cell transplantation. All mice were 
housed in pathogen-free conditions and animal studies were approved by the com- 
mittee on animal experimentation of the Institut Pasteur and by the French Ministry 
of Agriculture. Differentiated cells (5 x 10° cells per animal in 50 pl DMEM) were 
injected into the spleens of 3- to 4-week-old Alb-uPA*’ *;Rag2 ‘Ire mice 
(n = 7). The recipient mouse was killed 2 weeks after transplantation for histological 
analysis. Blood samples were collected and human albumin in plasma was quan- 
tified by ELISA (Bethy Laboratories). Frozen liver sections were analysed by 
immunofluorecence with human albumin (Dako) or human A1AT (Dako) spe- 
cific antibodies. Non-transplanted mice were used as controls. 

Exome sequencing. The corrected iPSC line, B-16-C2, and its parental fibroblasts 
were analysed. Exome sequencing and analysis were performed as described previ- 
ously™* with minor modifications. Exome pull-down was performed using an 
Agilent SureSelect Human All Exon 50Mb Kit according to the manufacturer’s 
instructions. Enriched DNA was sequenced on an Illumina HiSeq 2000 
(75-bp paired-end sequencing). 90.32% (Fibroblast-B) and 90.72% (B-16-C2) 
of total targeted regions were covered with more than 10 sequencing depth, 
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covering 93.01% and 93.35% of CCDS exons, respectively. Substitutions in the 
coding sequence were called as positions with at least 20% of reads reporting a 
different base with respect the reference human sequence (GRCh37). Additionally, 
somatic mutations were identified by comparing the sequence with the control 
fibroblasts, and removing the common polymorphisms described in dbSNP and in 
the 1000 Genomes Project**. Small insertions and deletions were identified using 
samtools, as the ones not present in the control cell line and that had at least 20 of 
coverage and 20% of the reads reporting the mutation. Validation of mutations 
was carried out by Sanger capillary sequencing on parental Fibroblast-B, 
A1ATD-iPSC line B, the homozygously targeted B-16 cells and the piggyBac- 
excised B-16-C2 cells. 

Other experimental procedures. Sendai virus regrogramming, RT-PCR, quanti- 
tative RT-PCR, bisulphite sequencing, immunostaining, flow cytometric analysis, 
ELISA and EndoH analysis were performed as described previously'®**”*. 
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Metabolic priming by a secreted fungal effector 
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Maize smut caused by the fungus Ustilago maydis is a widespread 
disease characterized by the development of large plant tumours. 
U. maydis is a biotrophic pathogen that requires living plant tissue 
for its development and establishes an intimate interaction zone 
between fungal hyphae and the plant plasma membrane. U. maydis 
actively suppresses plant defence responses by secreted protein 
effectors’’. Its effector repertoire comprises at least 386 genes 
mostly encoding proteins of unknown function’** and expressed 
exclusively during the biotrophic stage’. The U. maydis secretome 
also contains about 150 proteins with probable roles in fungal nutri- 
tion, fungal cell wall modification and host penetration as well as 
proteins unlikely to act in the fungal-host interface* like a choris- 
mate mutase. Chorismate mutases are key enzymes of the shikimate 
pathway and catalyse the conversion of chorismate to prephenate, 
the precursor for tyrosine and phenylalanine synthesis. Root-knot 
nematodes inject a secreted chorismate mutase into plant cells likely 
to affect development**. Here we show that the chorismate mutase 
Cmul1 secreted by U. maydis is a virulence factor. The enzyme is 
taken up by plant cells, can spread to neighbouring cells and changes 
the metabolic status of these cells through metabolic priming. 
Secreted chorismate mutases are found in many plant-associated 
microbes and might serve as general tools for host manipulation. 

The U. maydis genome (http://mips.helmholtz-muenchen.de/genre/ 
proj/ustilago) contains genes for both a cytosolic chorismate mutase, 
designated aro7 (um04220), and a putatively secreted chorismate 
mutase, cmul (um05731). Cmul belongs to the AroQ class of 
eukaryotic chorismate mutases (Interpro: IPR008238) that have an 
all-o.-helical secondary structure (Supplementary Fig. 1)”*. To verify 
that Cmul is a dedicated chorismate mutase, we demonstrated that it 
complemented a Saccharomyces cerevisiae aro7 mutant (Fig. 1a) and 
that heterologously expressed protein had chorismate mutase activity 
which was not feedback inhibited by aromatic amino acids (Sup- 
plementary Fig. 2). Allosteric regulation is a characteristic feature 
of plastidic chorismate mutases as well as of cytoplasmic fungal 
chorismate mutases”'®, whereas cytosolic plant chorismate mutases 
lack this feature''. Attempts to generate a cmu1 mutant that displayed 
allosteric regulation based on features of S. cerevisiae Aro7p were 
unsuccessful (Supplementary Fig. 3). Western blot analysis detected 
Cmul-haemagglutinin (HA) in U. maydis culture supernatants when 
the respective fusion gene was expressed under a constitutive promoter 
in hyphal cells (Supplementary Fig. 4). The secretion of Cmu1 during 
plant colonization was independently demonstrated by proteome 
analysis of apoplastic fluids isolated after infection of maize with a 
mixture of compatible U. maydis strains. Compared with known 
secreted effector proteins like Mig2 (ref. 12), higher numbers of 
Cmul peptides were identified at all time points analysed (Sup- 
plementary Information, Table 1). 


Like many other U. maydis effectors with a virulence function’*”, 


cmul is specifically upregulated during biotrophic development 
(Supplementary Fig. 5) and is one of the most highly expressed fungal 
genes during plant colonization’. To determine a possible contri- 
bution to virulence cmu1 was deleted in the solopathogenic strains 
$G200 (ref. 3) and CL13. CL13 is the progenitor strain of SG200 that 
shows attenuated virulence’® (see Supplementary Fig. 6a for disease 
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Figure 1 | Cmu1 has chorismate mutase activity, affects virulence and 
salicylic acid levels. a, U. maydis cmu1 complements the aro7 deletion in S. 
cerevisiae. Growth of the S. cerevisiae aro7 deletion mutant Y05479 on medium 
lacking tyrosine and phenylalanine (SD-phe-tyr) is restored by introduction of 
U. maydis cmu1; whereas cmu1p1834,K193a does not complement. Expression of 
cmul genes was driven by the GALI promoter. S. cerevisiae Y00000 (native 
ARO7 gene) was used as positive control. YEPD, rich medium; gal, galactose. 
b, Deletion of cmu1 negatively affects virulence of U. maydis strain CL13. 
Disease symptoms (as depicted in Supplementary Fig. 6a) on maize plants were 
scored? 12 days after infection with the indicated strains. Mean values of seven 
independent infections are shown with the total number of infected plants 
indicated above each column. Compared with CL13 and CL13Acmul-cmul- 
HA, CL13Acmu1 showed significantly reduced tumour formation (t-test, 

P = 0.037). ¢, Total amounts of salicylic acid were determined in plant leaves 
infected with the indicated U. maydis strains listed below 8 days after infection. 
For the infections with CL13Acmul, three independent strains were used. 
Mean values of three independent experiments are shown. Error bars, s.d. 
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symptoms of CL13) and hence facilitates the detection of modest 
differences in virulence’®. Whereas SG200Acmul strains showed little 
virulence attenuation (Supplementary Fig. 6b), the CL13Acmul 
mutant displayed a reduction of about 50% in tumours, which could 
be complemented by introducing a single copy of cmul-HA (Fig. 1b). 
This illustrates that Cmul is required for full virulence and demon- 
strates functionality of the HA-tagged protein. 

To localize Cmul during biotrophic growth, plants were infected 
with SG200Acmul-cmul-HA, which carries a cmu1-HA fusion gene 
inserted in single copy under control of its native promoter. Plants 
infected with SG200 or with SG200 P...u4;GFP-HA expressing cyto- 
plasmic green fluorescent protein (GFP) under the cmu1 promoter 
served as negative controls. Freeze-substituted and resin-embedded 
sections of maize tissue harvested 3 days after infection with these 
strains were incubated with anti-HA antibodies and gold markers. 
Cmul-HA could be detected inside the fungal hyphae, in the bio- 
trophic interface as well as inside the plant cytoplasm but rarely in 
the plant cell wall (Fig. 2A and Supplementary Fig. 7). The distribution 
of gold particles was quantified (Fig. 2B). Gold labelling of plant tissue 
infected with the parental strain SG200 was negligible (Supplemen- 
tary Fig. 8), whereas non-secreted GFP-HA was absent from the bio- 
trophic interphase, showed strong accumulation in the fungal cytosol 
and weak background labelling in the plant cytosol (Supplemen- 
tary Fig. 9 and Fig. 2B). Integrity of Cmul-HA was demonstrated 
by western blot analysis after immunoprecipitation from infected 
plant tissue (Supplementary Fig. 10). To demonstrate Cmul local- 
ization independently, plants were infected with SG200Acmul- 
cmul-mCherry-HA. Cmul-mCherry-HA was detected in the 
biotrophic interface, and plasmolysis experiments showed that it freely 
diffused in the enlarged apoplast (Supplementary Fig. 11). However, 
fluorescence could not be detected inside plant cells. In addition, 
Cmul1-mCherry—HA was unable to complement the virulence pheno- 
type of CL13Acmul (Supplementary Fig. 12a) despite the fact that 
the fusion protein was enzymatically active as demonstrated by 
complementation of the aro7 yeast mutant (Supplementary Fig. 13). 
Cmul2-290>-HA lacking the secretion signal was unable to 
complement the virulence phenotype of CL13Acmul (Supplemen- 
tary Fig. 12b), demonstrating that secretion is prerequisite for func- 
tion. In sum, these data suggest that Cmu1 needs to enter plants cells to 
exert its function and that Cmul-mCherry-HA is unable to do so. 

To elucidate the subcellular localization of Cmul1 in plant cells, a 
Cmu12-299-mCherry fusion protein lacking the signal peptide and 
active in complementing the yeast aro7 mutant was transiently 
expressed in maize leaves (Supplementary Fig. 13). Cmul 2_299- 
mCherry localized to the cytoplasm and the nucleus of transformed 
maize cells (Fig. 2C). Surprisingly, in some cases the Cmul22_290- 
mCherry signal was also visible in cells adjacent to the originally 
transformed cell (Fig. 2C, a). To rule out that the latter is caused by 
independent transformation events, Cmu1 2 399-yellow fluorescent 
protein (YFP) and PIP42¢6_593-mCherry encoding a fusion protein that 
localizes exclusively to the nucleus, were co-expressed (Supplemen- 
tary Fig. 14). Cell-to-cell spreading of Cmu1  _99-YFP was observed 
in some cases whereas PIP496-593-mCherry always remained in the 
nucleus of the originally transformed cell (Supplementary Fig. 14). 
Occasionally guard cells were transformed, and in such cases spread- 
ing of Cmul 7 299-YFP was never observed (Fig. 2C, c and Sup- 
plementary Fig. 14). Because guard cells lack plasmodesmata”, 
the observed spreading of Cmul 2-299 is likely to occur through 
plasmodesmata. 

By yeast two-hybrid analysis we demonstrated that Cmul can 
dimerize (Supplementary Fig. 15a), a property characteristic for 
AroQ chorismate mutases*. In addition, Cmul could interact with 
the two maize chorismate mutases ZmCm1 (B6TU00) and ZmCm2 
(B4FUP5) (Supplementary Fig. 15a). Despite low overall sequence 
conservation, known residues essential for chorismate mutase activity 
were conserved in all these enzymes (Supplementary Fig. 15b). 
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Figure 2 | Cmu1 is translocated to plant cells and spreads to neighbouring 
tissue. A, A maize section infected with U. maydis SG200Acmul-cmul-HA 
was collected 3 days after infection, probed with mouse anti-HA antibodies and 
detected with anti-mouse antibodies conjugated to 12-nm gold particles 
(Methods). Electron micrographs visualize Cmu1-HA inside fungal hyphae, in 
the biotrophic interface and in the cytoplasm of infected maize cells. Leaves 
infected with $G200 and $G200 P-muiGFP-HA served as negative controls 
(Supplementary Figs 8 and 9). fc, fungal cytosol; few, fungal cell wall; bi, 
biotrophic interphase; pc, plant cytosol; pew, plant cell wall; ppm, plant plasma 
membrane. Scale bars, 1 tm. B, Electron micrographs of immunogold-labelled 
sections were analysed for the spatial distribution of gold labels in 
S$G200Acmul-cmul-HA (blue) and $G200 PonyjGFP-HA (red) infected 
tissue 3 days after infection. The total number of gold labels in each electron 
micrograph was set to 100% (see Methods for details). Error bars, s.d. of gold 
particles counted in three independent cross-sections. C, Confocal Z-stacks 
visualize spreading of Cmul,,_299-mCherry to neighbouring tissue after 
biolistic transformation of maize leaves. White arrows indicate the originally 
transformed maize cells that carry a gold particle in their nucleus (a- 

c). Spreading of the fluorescent signal was observed in some cases for Cmul- 
mCherry (a) and not in others (b, c). Yellow arrows mark Cmu122_299- 
mCherry signals in nuclei of neighbouring cells (a). Cmu1 _399—mCherry 
spreading was never detected in transformed guard cells (c). Scale bars, 40 jim. 
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ZmCml encodes a predicted plastidic isoform (Supplementary Fig. 16a) 
whereas ZmCm2 codes for a putative cytoplasmic enzyme (Sup- 
plementary Fig. 15b). Localization to the respective compartments was 
demonstrated by transient expression in maize leaves (Supplementary 
Fig. 16b, c). The observed compartmentalization mimics what has been 
described for the chorismate mutases of Arabidopsis thaliana’’. 
Furthermore, as shown for the cytoplasmic isoform of chorismate 
mutase in A. thaliana’, ZmCm2 displayed enzymatic activity but no 
allosteric regulation in vitro (Supplementary Fig. 15c). 

To demonstrate that the interaction between Cmul and the cytosolic 
maize chorismate mutase ZmCm2 can have functional consequences, 
we attempted to show that Cmul could alter ZmCM2 activity 
in vitro. As this was unsuccessful (Supplementary Fig. 15d), we next 
generated a loss of function allele of cmu1 based on catalytically 
inactive forms of S. cerevisiae Aro7p (Supplementary Fig. 1a)”°”’. 
We reasoned that heterodimer formation between active and inactive 
monomers might interfere with chorismate mutase function. As 
expected, the cmulpig3a,.xi944 allele was unable to complement 
the aro7 deletion in S. cerevisiae (Fig. la). Surprisingly, when 
cmulpig3a,Ki94a Was introduced in single copy in either CL13Acmul 
or SG200Acmu1, virulence was completely abolished (Supplementary 
Fig. 17). When cmu1pis34,K1944 Was introduced in SG200 harbouring 
a functional cmu1 allele, the mutated allele had a dominant effect that 
was copy number dependent (Supplementary Fig. 17). Confocal 
microscopy of infected leaf tissue revealed that the SG200Acmul- 
cmulgig3a.Ki94a Strain could form appressoria (Supplementary Fig. 18a) 
but failed to colonize the plant (Supplementary Fig. 18b). In con- 
trast to SG200 infections, plant cells infected with SG200Acmul- 
cmulgig3a,xi9aa Were heavily stained by propidium iodide and 
displayed strong autofluorescence, probably because of the formation 
of phenolic compounds (Supplementary Fig. 18). This indicates that 
SG200Acmul1-cmulgig34,K1944 elicits a strong plant defence response. 

To exclude the possibility that the non-functional secreted chorismate 
mutase might interfere with the endogenous fungal shikimate pathway, 
we generated SG200Acmul derivatives that express cmu1pi34,K194A 
under control of a strong constitutive promoter (SG200Acmul- 
PotercmUlRig3a,Ki94a-HA). Western blot analysis confirmed that the 
mutant protein was produced and secreted (Supplementary Fig. 19a). 
These strains did not show a growth phenotype on minimal media 
lacking aromatic amino acids, were morphologically indistinguishable 
from SG200 during growth in minimal media, and were unaltered in 
filamentous growth on charcoal media, a prerequisite for successful 
infection (Supplementary Fig. 19b-e). This illustrates that the secreted 
Cmulgig3a,K194a—HA protein does not interfere with the activity of the 
cytoplasmic Aro7 protein in U. maydis. Cmulgig34Ki194a-mCherry- 
HA accumulated around biotrophic hyphae like other secreted effec- 
tors’ (Supplementary Fig. 20b) but was unable to cause a domi- 
nant negative virulence phenotype when expressed in SG200Acmu1 
(Supplementary Fig. 20a). This suggests that plant uptake of 
Cmul1pis3a,K104a-mCherry—-HA is necessary for the dominant effect 
on virulence, presumably because activity of ZmCM2 is affected. To 
obtain evidence that Cmulgig3axi9aa-HA can reduce ZmCm2 
activity, we first showed that ZmCm2 was able to interact with 
Cmulgig3a.Ki014a and could complement the aro7 mutation in 
S. cerevisiae (Supplementary Fig. 21). Next, zmcm2 was co-expressed 
with cmul or cmulgjs3a,xi9aa in the yeast aro7 mutant strain. 
Although the co-expression of zmcm2 and cmu1 had no detectable 
effect on growth, co-expression of zmcm2 and cmulpig34,K1944 
attenuated growth on plates lacking phenylalanine and tyrosine (Sup- 
plementary Fig. 21b). Therefore, the dominant negative effect on viru- 
lence elicited by the cmu1p1834,K194a-HA allele is probably caused by 
interfering with the activity of cytosolic ZmCm2 through dimerization. 
This also implies that orphan cytosolic plant chorismate mutases 
might have an important regulatory function. 

To obtain a comprehensive view on the metabolic changes in plants 
infected with CL13, CL13Acmul and CL13Acmul-cmul-HA, 
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metabolome analyses were conducted 8 days after infection (Sup- 
plementary Figs 22 and 23 and Supplementary Table 2). Compared 
with mock-infected maize, plants infected with CL13 showed 
enhanced levels for phenylpropanoid and lignan biosynthesis products 
as well as for benzoxazinones, which derive from tryptophan (Sup- 
plementary Fig. 22 and Supplementary Table 3). For plants infected 
with CL13 and CL13Acmul, the most notable differences concerned 
the phenylpropanoid pathway (Supplementary Fig. 22). Substances 
such as coumaroyl- and caffeoylquinate and syringine as well as lignan 
(like the syringaresinol-glucosides) were less abundant in tissue 
infected with CL13Acmul than in plants infected with either CL13 
or the complemented strain CL13Acmul-cmul-HA (Supplementary 
Fig. 22b and Supplementary Tables 2 and 3). In contrast, the amount of 
salicylic acid was at least ten times higher in plants infected with 
CL13Acmul1 than those infected with the parental strain CL13 or 
CL13Acmul-cmul-HA, respectively (Fig. 1c). The amounts of the 
tryptophan-derived benzoxazinones were not significantly different 
in CL13Acmul and CL13 infections (Supplementary Fig. 22), indi- 
cating that the pathway from chorismate to tryptophan through 
anthranilate synthase is unaffected by Cmul activity. The underlying 
mechanism for this differential effect awaits further study. Our results 
support a situation in which Cmul channels chorismate into the phe- 
nylpropanoid pathway and prevents its flow into the salicylic acid 
biosynthesis branch. 

To elucidate the biological significance of the elevated salicylic acid 
levels in CL13Acmu 1 infections, maize seedlings were treated locally 
with 4mM salicylic acid before infection or co-infiltrated during the 
infection with CL13. This concentration was chosen on the basis of 
total salicylic acid levels determined in CL13Acmul1-infected plants. 
Both treatments led to a reduction in virulence comparable to 
CL13Acmu1 infections (Supplementary Fig. 24), which illustrates that 
salicylic acid enhances resistance of maize towards U. maydis. The data 
imply that the observed decrease in virulence for CL13Acmu1 could be 
a direct consequence of its inability to interfere with pathogen-induced 
salicylic acid biosynthesis of the host plant. 

Our findings provide new insights into a process that aids U. maydis 
during colonization of maize plants. It relies on the secretion of a 
chorismate mutase which enters plant cells by an unknown mech- 
anism and redirects the metabolome in favour of the parasite. 

We propose that the translocated fungal enzyme acts in conjunction 
with ZmCm2 in the plant cytosol by increasing the flow of chorismate 
from the plastid to the cytosol and in turn lowering the available 
substrate for salicylic acid biosynthesis in plastids (Fig. 3). However, 
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Figure 3 | Model of Cmul-mediated metabolic priming in infected maize 

tissue. An infecting fungal hyphae is depicted in yellow. Maize cells are shown in 
mint green, the plastid is depicted in darker green. The dotted line indicates that 
prephenate or prephenate derivatives might be re-imported into the plastid or 
have regulatory capacity to feedback on plastidic synthesis of phenylalanine and 
tyrosine or derived phenolic compounds. SA, salicylic acid. For details, see text. 
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the introduction of a deregulated chorismate mutase into the host 
plant cytosol alone cannot explain all the metabolic changes observed 
in the U. maydis infected tissue (Supplementary Table 2 and Sup- 
plementary Fig. 18)**?. Thus, in line with its modest effects on viru- 
lence, we consider Cmu1 to be one component of a cocktail of effectors 
shaping the host metabolome. In this context it might not be coincid- 
ence that U. maydis has genes for two potential salicylate hydroxylases. 
In addition, organ-specific functions as described for several other 
U. maydis effectors cannot be excluded". 

The suppression of salicylic acid levels is likely to be particularly 
important for biotrophic pathogens and symbionts”’. In line with this 
we found genes encoding secreted chorismate mutases in many genomes 
of eukaryotic biotrophic plant pathogens and symbionts and several 
hemibiotrophic plant pathogens but only rarely in necrotrophic plant 
pathogens and fungal saprophytes (Supplementary Table 4). Recent 
findings indicate that the secreted chorismate mutase in the fungus 
Sclerotinia sclerotiorum might also represent a virulence factor (M. 
Dickman, personal communication). Metabolic priming by secreted 
chorismate mutases might thus emerge as a common strategy for host 
manipulation. 


METHODS SUMMARY 


The Methods section provides detailed information about all experimental proce- 
dures, including the following: (1) tables with details on oligonucleotides, plasmids, 
U. maydis and S. cerevisiae strains used or generated in this study; (2) details on the 
cloning strategies; (3) description of U. maydis mutant generation and their sub- 
sequent analysis; (4) links to bioinformatic tools applied in this study; (5) details for 
conducting quantitative real-time PCR analyses; (6) description of the yeast com- 
plementation assays; (7) description of yeast protein interaction assays; (8) details 
for conducting chorismate mutase activity assays; (9) the method to demonstrate 
protein secretion in U. maydis; (10) details on the transient expression in Zea mays; 
(11) confocal and electron microscopy methods; (12) details for metabolome and 
hormone analyses; and (13) protocol for the isolation and mass spectrometric 
analysis of secreted proteins of apoplastic fluids. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Generation of plasmids. Standard molecular cloning strategies and techniques 
were applied in this study~*. Most of the constructs were generated using Gateway 
technology (Invitrogen) after an insertion of the gene of interest into the pEntry4 
vector (Invitrogen) using Ncol and NotI restriction sites. Primers used in this 
study are described in Supplementary Table 5. Plasmids that were generated in 
this study are listed in Supplementary Table 6. 

Mutant generation and analysis. All U. maydis strains (Supplementary Table 7) 
were generated by gene replacement with PCR-generated constructs or by inser- 
tion of p123 derivatives into the ip locus as described” (Supplementary Table 7). 
At least three independent mutants were repeatedly tested for virulence on 7-day- 
old maize seedlings and disease was scored 12 days after infection following 
described protocols’. The widely used solopathogenic haploid strains SG200 
and CL13 differ in virulence owing to the presence of autocrine pheromone 
signalling in SG200 and its absence in CL13, respectively'®. Compared with the 
naturally occurring dikaryon, both strains show reduced virulence. Typical symp- 
toms caused by CL13 are depicted in Supplementary Fig. 3a. 

Bioinformatic analyses. Signal peptide prediction was performed with the pro- 
gram SignalP 3.0 (http://www.cbs.dtu.dk/services/SignalP/). Chloroplast transit 
peptides were predicted with the program ChloroP (http://www.cbs.dtu.dk/ 
services/ChloroP/). Sequence alignments were generated using CloneManager 
Suite 9.0 (www.scied.com). Hierarchical neural network was applied for prediction 
of the Cmu1 secondary structure at the Network Protein Sequence Analysis server 
(http://npsa-pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_nn.html). Domain 
analyses were performed with Smart and InterPro (http://smart.embl-heidelberg. 
de/; http://www.ebi.ac.uk/Tools/InterProScan/). 

Quantitative real-time PCR. RNA was extracted from sporidia grown in axenic 
culture as well as from infected maize plants at the indicated time points with the 
TRIzol method (Invitrogen), treated with DNase (Ambion) and subsequently used 
for cDNA synthesis. Quantitative real-time PCR reactions were conducted as 
described earlier’. All reactions were performed at least in biological triplicates. 
Relative cmu1 expression levels were calculated in relation to the values obtained 
for the constitutively expressed peptidyl-prolyl cis—trans isomerase gene (ppi) of 
U. maydis*° (Supplementary Table 5). 

Yeast complementation assay. Yeast strain Y054679 lacking the ARO7 gene was 
transformed with the corresponding pYES and pGad derivatives (Supplementary 
Table 6) using standard protocols (Clontech) and tested for growth on medium 
lacking phenylalanine and tryptophan as described previously’’. A compilation of 
all S. cerevisiae strains used in this study is provided in Supplementary Table 8. 
Yeast protein interaction assay. The genes encoding the proteins tested for 
interaction were cloned into pGBKT7 or pGADT7 vectors (Clontech; Supplemen- 
tary Table 6), generating in-frame fusions with a gene encoding the yeast GAL4 
binding and activation domain, respectively. Interaction was tested in S. cerevisiae 
AH109 (Clontech). Growth controls were performed on selective dropout media 
(SD) plates lacking only tryptophan and leucine to select for cells containing the 
correct plasmids. Protein interactions were assayed on high-stringency SD plates 
additionally lacking adenine and histidine. 

Chorismate mutase activity assays. A glutathione S-transferase (GST)-Cmu 1 2_299- 
HA fusion protein was produced in Escherichia coli BL21 containing plasmid 
pRset-cmu2-299-HA (Supplementary Table 6) and enriched by glutathione- 
affinity purification (GE Healthcare). Also, GST-ZmCm2 fusion protein and 
derivates of Cmul for the enzyme assay were made accordingly. After removal 
of the GST moiety using PreScission protease (GE Healthcare), chorismate 
mutase activity assays were performed”. After acidic conversion of prephenate 
to phenylpyruvate the reaction was basidified and extinction at 2 = 320 nm was 
measured. The increase in extinction was plotted against time (in minutes) to 
visualize the formation of phenylpyruvate. Error bars represent s.d. from three 
technical replicates. Purified GST protein was used as a negative control and 
respective values were subtracted from those obtained with Cmul 2_299-HA. 
Demonstration of Cmul secretion. U. maydis strain AB33 P.teecmul—-HA was 
generated by insertion of plasmid p123_otef:um05731-HA into the ip locus of 
AB33** (Supplementary Table 7). To analyse Cmul-HA secretion, material was 
collected 6h after induction of filamentous growth in medium containing 
nitrate**. Protein extracts of filamentous cells and culture supernatants (after 
precipitation with trichloroacetic acid) were subjected to western blot analysis 
with mouse-anti HA (Sigma) and mouse anti-o-tubulin antibodies (Oncogene). 
Biolistic transformation of Z. mays. For biolistic transformation” of 7- to 
10-day-old maize leaves, 1.6-j1m gold particles were coated with plasmid DNA 
coding for the indicated genes driven by the CaMVS35 promoter (Supplementary 
Table 6). Bombardment was performed using a PDS-1000/He™ instrument 
(BioRad) at 900 p.s.i. in a 27 Hg vacuum. Fluorescence was observed by confocal 
microscopy 2 days after transformation. 
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Confocal and electron microscopy. Confocal microscopy was performed with a 
LeicaSP5 confocal microscope as described*’. Wheat germ agglutinin/Alexa Fluor 
488 and propidium iodide stains were performed as reported”. Autofluorescence 
was detected at 2 = 415-460 nm. 

For immunogold labelling, infected leaf parts were cryofixed by high-pressure 
freezing (Bal-Tec HPM 010), freeze-substituted in 0.5% glutaraldehyde in acetone 
(containing 2% H,0), infiltrated with Lowicryl HM20 and ultraviolet-polymerized 
at —40 °C. Ultrathin sections were labelled for HA epitope detection using mouse 
anti-HA (Sigma H9658) and donkey anti-mouse 12-nm gold antibodies (Jackson 
715-205-150) and imaged in a Philips CM10 electron microscope at 60kV. 

The distribution of gold particles was determined semi-quantitatively as 

described*’. Micrographs of sectioned Z. mays samples from infections with 
$G200 PemuiCmul-HA and $G200 Penu;GFP-HA were selected and in each case 
three different hyphae were chosen. Gold particles on each of the micrographs 
were then counted and assigned to the plant cell cytosol, the biotrophic interface or 
the fungal cytosol. The proportional distribution in these compartments was then 
calculated as a percentage and the s.d. was calculated from the three different data 
sets for each sample. 
Metabolome analyses. For metabolite fingerprinting a section of the third leaf 
between 1 and 3cm below the injection holes was excised 8 days after syringe 
infection with U. maydis strains or water (mock control), respectively. For each 
replicate, 30-40 leaf sections were pooled. Plant material was homogenized under 
liquid nitrogen. Two or three biological replicates of control leaves and infected 
leaves (80 mg each) were extracted with methyl-tert-butylether/methanol**. The 
polar phase was dried under a nitrogen stream and the extracted metabolites 
resolved in 10 ul of methanol, 10 ul acetonitrile and 120 ul water. The metabolite 
analysis was performed by ultra-performance liquid chromatography (UPLC, 
ACQUITY UPLC System, Waters Corporation) coupled with an orthogonal 
time-of-flight mass spectrometer (TOF-MS, LCT Premier, Waters Corporation). 
For LC an ACQUITY UPLC BEH SHIELD RP18 column (1mm X 100mm, 
1.7 um particle size, Waters Corporation) was used at a temperature of 40°C, a 
flow rate of 0.2 ml min’ and with the following gradient for the analysis of the 
polar phase: 0-0.5 min 10% B, 0.5-3 min from 10% B to 28% B, 3-8 min from 28% 
B to 95.5% B, 8-10 min 95.5% B and 10-14 min 10% B (solvent system A: water/ 
formic acid (100:0.1, v/v); B: acetonitrile/formic acid (100:0.1, v/v)). The TOF-MS 
was operated in negative as well as positive electrospray ionization mode in W 
optics with a mass resolution larger than 10,000. Data were acquired by MassLynx 
software (Waters Corporation) in centroided format over a mass range of m/z 85- 
1,200 with scan duration of 0.5 s and an interscan delay of 0.1 s. The capillary and 
the cone voltage were maintained at 2,700 V and 30 V and the desolvation and 
source temperature at 350 °C and 80 °C, respectively. Nitrogen was used as cone 
(301h~!) and desolvation gas (8001h~!). For accurate mass measurement, the 
TOF-MS was calibrated with phosphoric acid 0.01% (v/v) in acetonitrile/water 
(50:50, v/v) and the dynamic range enhancement mode was used for data record- 
ing. All analyses were monitored by using leucine-enkephaline ([M + H]* 
556.2771 or [M—H] 554.2615 as well as its ‘°C isotopomer [M+ H]* 
557.2803 or [M—H] 555.2615, Sigma-Aldrich) as lock spray reference com- 
pound at a concentration of 0.5 ug ml ' in acetonitrile/water (50:50, v/v) and a 
flow rate of 30 ,lmin™’. The raw mass spectrometry data of all samples were 
processed using the MarkerLynx Application Manager for MassLynx software 
(Waters Corporation), resulting in two data sets. 

The toolbox MarVis (http://marvis.gobics.de33) was used for ranking, filtering, 
adduct correcting and combining the data as well as for clustering and visualiza- 
tion, respectively. An analysis of variance test was applied to extract a subset of 
high-quality marker candidates with a p value less than 1 X 10°. The filtered data 
sets were adduct corrected according to the following rules: [M+ H]", 
[M + Na]*, [M+ NH,]* for the positive and [M— H], [M+ CH,0,— H], 
[M + CH,0, + Na— 2H] " for the negative ionization mode. The combined data 
led to an overall data set of 810 marker candidates (Supplementary Table 2), which 
were used for clustering and visualization by means of one-dimensional self- 
organizing-maps and for database search. 

The identity of selected markers was confirmed by MS’ fragment information™, 

co-elution with identical standards or exact mass measurement (Supplementary 
Table 3). 
Salicylic acid measurements. For metabolite fingerprinting, a section of the third 
leaf between 1 and 3 cm below the injection holes was excised 8 days after syringe 
infection with U. maydis strains or water (mock control), respectively. For each 
replicate, 30-40 leaf sections were pooled. 

Total salicylic acid was extracted’ and identified by co-elution with an authentic 
standard using liquid chromatography—mass spectrometry. 

Proteome analysis of apoplastic fluids. To extract apoplastic fluids, maize 
seedlings were infected with a mixture of FB1 and FB2 (ref. 36). Two, four and 
six days after infection, infected areas were excised and apoplastic fluid was 
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collected*’. After precipitation with trichloroacetic acid, proteins were separated 
by 12% SDS-polyacrylamide gel electrophoresis, digested in gel’* after subdividing 
each lane into 11 equal parts and run on an Agilent 1100 nano-HPLC system 
(75-uum C18 column, 100-min gradients), coupled to an LTQ-FT mass spectro- 
meter (Thermo Scientific). The “Top-3-SIM’ acquisition method was used, as 
described*’. Spectra were processed by MSQuant“ and searched using Mascot 
against a decoy Zea/Ustilago protein database. Mass tolerance for the precursor 
ion was in all cases 5 p.p.m, and for fragment ions 0.5 Da; full trypsin specificity 
was required and two missed cleavages were allowed. The mean measurement 
mass deviation of precursor (peptide) ions was 0.96 p.p.m. with a standard devi- 
ation of 0.82. 


24. Sambrook, J. F. E. & Maniatis, T. Molecular Cloning: A Laboratory Manual. Cold 
Spring Harbor Laboratory Press, 1200 (1989). 

25. Loubradou, G., Brachmann, A., Feldbrugge, M. & Kahmann, R. A homologue of the 
transcriptional repressor Ssn6p antagonizes cAMP signalling in Ustilago maydis. 
Mol. Microbiol. 40, 719-730 (2001). 

26. Bohlmann, R. Isolierung und Charakterisierung von filamentspezifisch exprimierten 
Genen aus Ustilago maydis. PhD thesis, Ludwig-Maximilian-Univ. Miinchen (1996). 

27. Gilchrist, D. G. C. & Conelly, J. A. Chorismate mutase from mung bean and 
sorghum. Methods Enzymol. 142, 450-463 (1987). 

28. Brachmann, A., Weinzierl, G., Kamper, J. & Kahmann, R. Identification of genes in 
the bW/bE regulatory cascade in Ustilago maydis. Mol. Microbiol. 42, 1047-1063 
(2001). 

29. Ueki, S., Lacroix, B., Krichevsky, A., Lazarowitz, S. G. & Citovsky, V. Functional 
transient genetic transformation of Arabidopsis leaves by biolistic bombardment. 
Nature Protocols 4, 71-77 (2008). 


30. 
31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


Doehlemann, G. et al. Establishment of compatibility in the Ustilago maydis/maize 
pathosystem. J. Plant Physiol. 165, 29-40 (2008). 

Shimada, Y., Ichinose, S., Sadr, A., Burrow, M. F. & Tagami, J. Localization of matrix 
metalloproteinases (MMPs-2, 8, 9 and 20) in normal and carious dentine. Aust. 
Dent. J. 54, 347-354 (2009). 

atyash, V., Liebisch, G., Kurzchalia, T. V., Shevchenko, A. & Schwudke, D. Lipid 
extraction by methyl-tert-butyl ether for high-throughput lipidomics. J. Lipid Res. 
49, 1137-1146 (2008). 

Kaever, A. et al. MarVis: a tool for clustering and visualization of metabolic 
biomarkers. BMC Bioinformatics 10, 92 (2009). 

Pommerrenig, B. et a/. Phloem-specific expression of yang cycle genes and 
identification of novel yang cycle enzymes in plantago and Arabidopsis. Plant Cell 
23, 1904-1919 (2011). 

Naranjo, M. A. et al. Lithium treatment induces a hypersensitive-like response in 
obacco. Planta 217, 417-424 (2003). 

Banuett, F. & Herskowitz, |. Different a alleles of Ustilago maydis are necessary for 
maintenance of filamentous growth but not for meiosis. Proc. Natl Acad. Sci. USA 
86, 5878-5882 (1989). 

De Wit, P. J. G. M. & Spikman, G. Evidence for the occurence of race and cultivar- 
specific elicitors of necrosis in intercellular fluids of compatible interactions of 
Cladosporium fulvum and tomato. Physiol. Plant Pathol. 21, 1-11 (1982). 
Shevchenko, A., Tomas, H., Havlis, J., Olsen, J. V. & Mann, M. In-gel digestion for 
mass spectrometric characterization of proteins and proteomes. Nature Protoc. 1, 
2856-2860 (2006). 

Olsen, J. V. & Mann, M. Improved peptide identification in proteomics by two 
consecutive stages of mass spectrometric fragmentation. Proc Natl Acad Sci USA 
101, 13417-13422 (2004). 

Mortensen, P. et al. MSQuant, an open source platform for mass spectrometry- 
based quantitative proteomics. J. Proteome Res. 9, 393-403 (2010). 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature10525 


A vascular niche and a VEGF-Nrp!1 loop regulate the 
initiation and stemness of skin tumours 
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Angiogenesis is critical during tumour initiation and malignant 
progression’. Different strategies aimed at blocking vascular 
endothelial growth factor (VEGF) and its receptors have been 
developed to inhibit angiogenesis in cancer patients”. It has become 
increasingly clear that in addition to its effect on angiogenesis, other 
mechanisms including a direct effect of VEGF on tumour cells may 
account for the efficiency of VEGF-blockade therapies’. Cancer 
stem cells (CSCs) have been described in various cancers including 
squamous tumours of the skin**. Here we use a mouse model of 
skin tumours to investigate the impact of the vascular niche and 
VEGF signalling on controlling the stemness (the ability to self 
renew and differentiate) of squamous skin tumours during the 
early stages of tumour progression. We show that CSCs of skin 
papillomas are localized in a perivascular niche, in the immediate 
vicinity of endothelial cells. Furthermore, blocking VEGFR2 
caused tumour regression not only by decreasing the microvascular 
density, but also by reducing CSC pool size and impairing CSC 
renewal properties. Conditional deletion of Vegfa in tumour 
epithelial cells caused tumours to regress, whereas VEGF over- 
expression by tumour epithelial cells accelerated tumour growth. 
In addition to its well-known effect on angiogenesis, VEGF affected 
skin tumour growth by promoting cancer stemness and symmetric 
CSC division, leading to CSC expansion. Moreover, deletion of 
neuropilin-1 (Nrp1), a VEGF co-receptor expressed in cutaneous 
CSCs, blocked VEGF’s ability to promote cancer stemness and 
renewal. Our results identify a dual role for tumour-cell-derived 
VEGF in promoting cancer stemness: by stimulating angiogenesis 
in a paracrine manner, VEGF creates a perivascular niche for CSCs, 
and by directly affecting CSCs through Nrp1 in an autocrine loop, 
VEGF stimulates cancer stemness and renewal. Finally, deletion of 
Nrp1 in normal epidermis prevents skin tumour initiation. These 
results may have important implications for the prevention and 
treatment of skin cancers. 

Skin squamous cell carcinoma is the second most frequent skin 
cancer in humans and affects about 500,000 new patients per year world- 
wide®. Mouse models for skin squamous tumours are almost identical to 
human skin cancers and thus offer an ideal model to study cancer 
initiation and growth’. The most extensively used mouse cancer model 
relies on chemically (9,10-dimethyl-1,2-benzanthracene (DMBA) and 
12-O-tetradecanoyl phorbol-13-acetate (TPA)) induced skin carcino- 
genesis protocol’. In this model, the number of papillomas is a measure 
of tumour initiation, whereas the growth rate is an indicator of tumour 
promotion. Recently, CD34-expressing tumour epithelial cells (TECs) 
isolated from DMBA/TPA-induced skin tumours were identified as 
CSCs, based on their increased clonogenic potential and ability to form 
secondary tumours upon transplantation into immunodeficient mice’. 
It has been suggested that the tumour microenvironment, in particular 


the vascular niche, can influence CSC function in brain tumours’, but 
whether CD34* CSCs are located ina vascular niche and whether these 
vessels regulate CSC function and stemness remain unknown. 

To assess whether CSCs of skin papillomas reside in a perivascular 
niche, we performed triple immunostaining for cutaneous CSC markers 
(see below) (CD34, Hmga2 and Krt13), TEC markers (K5 and K14) and 
endothelial cell markers (endoglin and CD31). Papillomas are well 
organized and differentiated tumours containing one or two layers of 
proliferating basal-like cells, expressing K5 and }4-integrin, which give 
rise to several layers of differentiated tumour cells expressing K1/K10 
and loricrin (Supplementary Fig. 1). As expected, skin CSCs co- 
expressed cutaneous CSC and TEC markers, but did not express 
endothelial cell markers and can thus be reliably distinguished from 
vascular endothelial cells (Fig. la, b and Supplementary Fig. 2). 
Interestingly, most CSCs were detected in close proximity to endothe- 
lial cells (within 30 um), whereas the more differentiated tumour cells 
were located more distantly from endothelial cells (Fig. 1a, b and 
Supplementary Fig. 2), indicating that CSCs of skin papilloma may 
reside in a vascular niche, similarly to what has been described for 
haematopoietic stem cells’®’’ and CSCs in brain tumours’. 

To determine the role of the vascular microenvironment in regulat- 
ing the stemness of skin tumours, we used an antibody-based strategy 
to inhibit angiogenesis by blocking VEGFR2 signalling in endothelial 
cells (Fig. 1c). In skin papillomas, VEGFR2 is almost exclusively 
expressed by endothelial cells and is not detectable in TECs (Fig. 1d, 
e and Supplementary Fig. 3). Administration of a blocking antibody 
against VEGFR2 (DC101)—a powerful inhibitor of tumour angio- 
genesis'*—to mice with established skin papillomas resulted in tumour 
regression and a significant decrease in endothelial cell prolifera- 
tion and tumour microvascular density (Fig. 1f-h and Supplemen- 
tary Fig. 4), consistent with previous anti-angiogenic reports of 
DC101 in human squamous cell carcinoma xenografts’*. Notably, DC101 
also reduced the proportion of CD34* CSCs, as determined by immu- 
nofluorescence (K5*CD34") and FACS analysis (Epcam* CD34"), 
and the decrease in tumour microvascular density closely correlated 
with the decrease in the CD34" CSC pool size (Fig. li, j and Sup- 
plementary Fig. 5). This decrease in CD34* TECs was not due to an 
increase in CSC apoptosis, but rather to a reduction in CSC prolifera- 
tion (Fig. 1k, | and Supplementary Fig. 6). We therefore explored 
whether VEGF might also affect CSCs directly using various genetic 
gain- and loss-of-function approaches. 

Deletion of Vegfa in epidermal cells prevents squamous skin tumour 
development'*”’, whereas transgenic overexpression of VEGF increases 
the number and accelerates the growth of papillomas after DMBA/TPA 
treatment'®*’”. Thus, VEGF is already known to regulate skin tumour 
initiation, but its role in regulating cancer stemness, and thereby 
skin tumour progression, remains unknown. Notably, quantitative 
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Figure 1 | A perivascular niche regulates cancer stemness in skin papilloma. 
a, Immunostaining for CD34, keratin 5 (K5) and endoglin (Endo). Hoechst 
nuclear staining is represented in blue. b, Distribution of the CD34* and 
CD34 TECs relative to the distance to endothelial cells; n = 8 papillomas. EC, 
endothelial cell. c, Protocol of DMBA/TPA and of anti- VEGFR2 blocking 
antibody administration. d, RT-PCR analysis of Vegfr2 mRNA expression in 
FACS-isolated tumour endothelial cells and CD34* TECs; n = 6 papillomas. 
e, Immunostaining for VEGFR2, endoglin and KS in a papilloma shows that 
VEGFR2 is only expressed by endothelial cells. f, Tumour size after 2 weeks of 
anti- VEGFR2 treatment; n = 36 papillomas. g, Immunostaining for endoglin 
and KS in papillomas treated with control IgG or anti- VEGFR2. h, Microvessel 
density; n = 12 tumours. i, Immunostaining for CD34, K5 and endoglin. str, 
tumour stroma. j, FACS quantification of CD34* TECs. k, Immunostaining for 
B4 integrin (B4), BrdU and endoglin. |, Proliferation of TECs as measured by 
BrdU incorporation in B4* TECs after treatment with control or anti- VEGFR2 
antibodies for 2 weeks; n = 12 papillomas. All scale bars represent 50 tim; all 
error bars represent s.e.m. 


polymerase chain reaction with reverse transcription (qRT-PCR) 
showed that Vegfa mRNA is expressed at higher levels in CD34* 
CSCs than in normal epidermal cells or CD34 TECs (Fig. 2a). To 
explore whether VEGF expression by TECs regulates the stemness 
properties of cutaneous CSCs and is required to sustain tumour 
growth, we deleted Vegfa in TECs by administrating tamoxifen to 
K14CreER: VEGF“ mice’* once they had established skin papillomas 
(Fig. 2b, c). As soon as 1 week after tamoxifen administration, most of 
the papillomas had shrunk by 60% and 2 weeks after, most of the 
papillomas had completely disappeared (Fig. 2d). As expected, 
tumour regression was associated with a decrease in endothelial cell 
proliferation and microvascular density (Fig. 2e, fand Supplementary 
Fig. 7). However, the magnitude of the antitumour effect suggested 
that inhibition of angiogenesis alone could not fully explain the com- 
plete disappearance of the established tumours. Indeed, the loss of 
VEGF in TECs also resulted in a marked reduction in the proportion 
of CD34* TECs (Fig. 2g, h), and a strong decrease in basal cell pro- 
liferation (Fig. 2i, j), indicating that VEGF secretion by TECs has an 
impact on cutaneous CSCs as well. 
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Figure 2 | VEGF expression by tumour cells is required for the maintenance 
of cutaneous cancer stem cells. a, qRT-PCR for Vegfa in normal 
interfollicular keratinocytes (IFE), CD34* TECs and CD34 TECs;n =6 
papillomas per condition. b, Genetic strategy used to study the role of VEGF-A 
expression by TECs. c, Protocol of DMBA/TPA and tamoxifen (TAM) 
administration. d, Tumour size 1 and 2 weeks after the beginning of tamoxifen 
administration (15 mg per week) in control and Vegfa conditional knockout 
(cKO) mice; n = 30. e, Proliferation of endothelial cells as determined by the 
proportion of Ki67*/endoglin* endothelial cells 2 weeks after tamoxifen 
administration; n = 6 tumours. f, Microvessel density 1 week after tamoxifen 
administration; n = 12 tumours. g, Immunostaining for CD34, K5 and 
endoglin in a representative papilloma from mice of each genotype. h, FACS 
quantification of CD34* TECs after tamoxifen administration; n = 6 
papillomas. i, Immunostaining for B4 integrin (4), endoglin and Ki67 in 
papillomas 1 week after tamoxifen administration. j, TEC proliferation as 
measured by the proportion of Ki67~/B4* TECs in papillomas from each 
genotype 2 weeks after tamoxifen administration; n = 12 tumours. 

k, Immunostaining for B4 integrin ({4, green) and NuMA (red) in papillomas. 
1, Quantification of the division planes as measured by the angle between the 
centrosomal axis (NuMa) and the basal membrane (f4-integrin) in control and 
Vefga cKO papillomas; n = 162 (vegfa cKO) and 258 (Ctrl) cells. For each panel 
showing immunostaining, Hoechst nuclear staining is represented in blue. All 
scale bars represent 50 jm; all error bars represent s.e.m. 


This hypothesis was further underscored by analysis of symmetric 
versus asymmetric cell division of the CSCs. The orientation of the 
mitotic spindle controls the balance between epidermal progenitor 
self-renewal (symmetric cell division) and differentiation (asymmetric 
cell division) during embryonic development'”. Analysis of the ori- 
entation of mitotic spindle in TECs revealed that approximately 40% of 
the mitosis events during tumour growth were symmetric in nature. 
Interestingly, deletion of Vegfa in TECs led to a significant decrease in 
the proportion of symmetric self-renewing cell divisions (Fig. 2k, 1). Loss 
of VEGF also resulted in a decrease in the number of layers of differ- 
entiated TECs, but no significant increase in apoptosis was observed 
(Supplementary Fig. 8). Thus, autocrine VEGF expression by TECs 
seems to be critical for the proliferation and renewal of cutaneous CSCs. 
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To establish further the critical autocrine role of VEGF, we also 
assessed whether increased VEGF production by TECs would increase 
CSC renewal. We therefore conditionally overexpressed VEGF spe- 
cifically in TECs by administering tamoxifen to K14CreER:Rosa26- 
VEGF-164 mice”, after they had already formed skin papillomas 
(Fig. 3a, b). Tamoxifen administration enhanced VEGF expression 
by TECs in K14CreER:Rosa26-VEGF-164 mice, and resulted in accel- 
erated tumour growth. As expected, this tumour-promoting effect of 
VEGF was accompanied by a strong increase in endothelial cell 
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Figure 3 | Increased VEGF expression promotes cutaneous cancer-stem-cell 
expansion. a, Genetic strategy used to study the consequences of VEGF-A 
overexpression in TECs. b, Protocol of DMBA/TPA and tamoxifen 
administration. c, Tumour size 2 weeks after the first tamoxifen administration 
(10 mg per week); n = 40 tumours. d, Microvessel density 2 weeks after 
tamoxifen administration; n = 15 tumours. e, Immunostaining for K5, CD34 
and endoglin. f, FACS quantification of the proportion of CD34 TECs 2 weeks 
after tamoxifen administration. g, h, Immunostaining and quantification of 
Ki67* and CD34* TECs in control and VEGF-164 overexpressing papillomas; 
n= 10 tumours. i, Quantification of the division planes in control and VEGF- 
164 overexpressing papillomas; n = 348 (VEGF-164) and 192 (Ctrl) cells 

2. weeks after tamoxifen administration. j, Venn diagram from microarray 
comparing CD34" versus CD34~ TECs (CD34 signature) and control versus 
VEGF-164 CD34" TECs (VEGF signature). k, mRNA expression of ten genes 
of the CD34 signature upregulated by VEGF-164 3 weeks after tamoxifen 
administration; n = 3 tumours. l, FACS quantification of the proportion of 
CD133* TECs after VEGF-164 overexpression; n = 6 tumours. 

m, n, Immunostaining for CD34 and keratin 13 (K13) or Hmga2 in papillomas 
2 weeks after tamoxifen administration. Hoechst nuclear staining is 
represented in blue; all scale bars represent 50 jum; all error bars represent s.e.m. 
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proliferation and microvascular density (Fig. 3c, d and Supplementary 
Fig. 9). VEGF overexpression in TECs also caused a major expansion 
of the CD34* CSC population, increased the proliferation of CD34* 
CSCs, and the proportion of symmetric CSC division (Fig. 3e-i and 
Supplementary Fig. 10), altogether expanding the CD34* CSC pool. 
Moreover, even though VEGF overexpression did not completely 
abrogate tumour cell differentiation, it did significantly delay tumour 
cell differentiation in vitro and in vivo (Supplementary Figs 10 and 14). 
Altogether, increased VEGF production by TECs enhanced tumour 
growth, not only by stimulating angiogenesis (and the formation of a 
vascular niche for CSCs) but also by promoting CSC renewal. 

To underscore further the notion that VEGF altered the stemness 
properties of CD34" CSCs and to determine the molecular mechan- 
isms of how VEGF promotes the renewal of CD34* CSCs, we 
transcriptionally profiled control and VEGF-overexpressing, FACS- 
isolated, CD34” CSCs to determine any possible ‘VEGF signature’. 
VEGF upregulated approximately 20% of the genes preferentiall ly 
expressed by CD34* CSCs as compared to CD34~ TECs (CD34 
signature), including genes known to regulate stemness (for example, 
Sox2, Hmga2), proliferation (Ccnd1, Ccng1) and the interaction of 
tumour cells with their microenvironment (Colla1, Col3a1), as well 
as genes known to be expressed in squamous cell carcinoma (Krt13, 
Ano1) (Fig. 3j, k, Supplementary Table 1 and Supplementary Fig. 11). 
Using qRT-PCR, FACS and immunostaining, we confirmed that 
around 50% of these genes were upregulated by VEGF (Fig. 3k-n 
and Supplementary Fig. 11). Thus, VEGF altered the transcriptional 
signature of CD34* CSCs in a manner that would be expected to 
promote cancer stemness. Consistent herewith, transplantation of a 
limiting dilution of CD34" TECs isolated from VEGF-overexpressing 
mice into immunodeficient mice increased the number of grafts with 
tumours compared to control CD34* CSCs, whereas the opposite 
phenotype was observed when CD34* VEGF-deficient CSCs were 
grafted (Supplementary Fig. 12). Thus, VEGF qualitatively altered 
the stemness properties of CSCs. 

It has been shown recently that deletion of VEGFRI (flt1) in 
epidermal cells delays the appearance of skin papilloma in K5-Sos 
transgenic mice", indicating that VEGF signalling in keratinocytes 
can directly regulate the initiation of these skin tumours. To explore 
whether VEGF signalling could affect tumour progression by directly 
regulating cutaneous CSC properties, we sought to design an experi- 
mental strategy to impair VEGF signalling in TECs. We first analysed 
which of the VEGF receptors were expressed in CSCs from skin 
papillomas. qRT-PCR and immunostaining showed that neuropilin 
1 (Nrp1) was expressed at the highest level by CD34* CSCs (Fig. 4a 
and Supplementary Fig. 13). Besides its well-known role as co-receptor 
of the VEGF receptors in endothelial cells, NRP1 is also expressed in 
various human carcinoma cell lines and primary tumours”. 
Furthermore, overexpression of Nrp1 promotes the growth of cancer 
cell lines whereas anti-Nrp1 blocking antibodies can inhibit tumour 
growth”. However, it remains unclear whether Nrp1 can control 
tumour initiation and stimulate tumour progression by promoting 
VEGF receptor signalling in endothelial cells or, additionally, by 
directly regulating VEGF signalling in TECs as previously suggested”. 

To explore the role of Nrp1 in skin tumorigenesis, we performed, as 
a first step, a constitutive conditional deletion of Nrp1 in the epidermis 
(K14-Cre:Nrp#”~ mice, Fig. 4b). Mice lacking Nrp] in their epidermis 
are viable, develop normally and have normal skin (Supplementary 
Fig. 13). Chemical skin carcinogenesis was then initiated in Nrp1- 
deficient mice and their respective littermate controls. After DMBA/ 
TPA treatment, 100% of the control mice developed skin papilloma 
after 25 weeks of TPA treatment, whereas, notably, mice lacking Nrp1 
in their epidermis did not develop any papilloma at this time point 
(Fig. 4c, d). These results indicate that Nrp1 expression in epidermal 
cells is critical for skin tumour initiation. 

To assess whether Nrp1 might directly regulate cutaneous tumour 
stemness, independently of an effect on the vascular niche, we 
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Figure 4| Nrp1 is required for squamous tumour initiation and regulates 
stemness of cutaneous cancer cells. a, RT-PCR for Nrp1 in normal 
interfollicular keratinocytes (IFE), CD34" hair follicle bulge stem cells, CD34 
TECs and CD34* TECs;n = 3 tumours. b, Genetic strategy used to study the cell- 
autonomous role of Nrp1 in skin tumour initiation (K14Cre:Nrp”_).c, Percentage 
of control and Nrp1 cKO mice presenting skin tumours. d, Average number of 
tumours per mouse 40 weeks after tumour initiation; Ctrl, n = 9; Nrp1 cKO, n = 11 
mice. e, Genetic strategy used to study the role of VEGF-164 and Nrp1 in pre- 
established skin tumour. f, Protocol of DMBA/TPA and tamoxifen administration. 
g, Tumour size (n = 30 tumours) 3 weeks after tamoxifen administration. 

h, Microvessel density (n = 15 tumours) 3 weeks after tamoxifen administration. 
i, Immunostaining for K5, CD34 and endoglin. j, FACS quantification of the 
proportion of CD34* TECs 3 weeks after tamoxifen administration (n = 9 
papillomas). k, Quantification of the proportion of Ed(U*CD34* TECs (n = 9 
tumours) 3 weeks after tamoxifen administration in control, VEGF-overexpressing 
and VEGF-overexpressing Nrp1-deficient TECs. 1, Quantification of the division 
planes in control and VEGF-164-overexpressing, Nrp1-deficient papillomas 

3 weeks after tamoxifen administration; n = 274 for VEGF-164/Nrp”~ and 

n = 224 for control mice. m, Cell growth of FACS-isolated CD34" and CD34— 
TECs after 1 week of in vitro culture and treated with indicated molecules; n = 9 
replicates. n, mRNA expression of 10 genes of the CD34 signature upregulated by 
VEGF, from control, VEGF-164 and VEGF-164/Nrp1 cKO papillomas 3 weeks 
after tamoxifen administration; n = 3 tumours. 0, Model of the dual role of VEGF in 
regulating cutaneous cancer stemness. Hoechst nuclear staining is represented in 
blue. All scale bars represent 50 jum; all error bars represent s.e.m. 
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simultaneously deleted Nrp1 and overexpressed VEGF selectively in 
TECs (Fig. 4e, f). Compared to VEGF overexpression in Nrp1*’~ cells 
or VEGF overexpression in wild-type cells, VEGF overexpression in 
Nrpi'’~ TECs no longer accelerated tumour growth (Fig. 4g). 
This inefficiency of VEGF was not due to an unresponsiveness of 
endothelial cells to paracrine VEGF, as VEGF overexpression by 
Nrp1-deficient TECs still increased tumour angiogenesis (Fig. 4h 
and Supplementary Fig. 14). However, by contrast, deletion of Nrp1 
in VEGF-overexpressing TECs abrogated the ability of VEGF to 
stimulate tumour cell proliferation, symmetric cell division and expan- 
sion of the CD34* CSC pool (Fig. 4i-1), indicating that Nrp1, ina cell- 
autonomous manner, mediated the response of CSCs to autocrine 
VEGF. 

To investigate this molecular mechanism further, we cultured FACS- 
isolated CD34* and CD34” TECs in vitro and exposed them to VEGF, 
in the presence or not of anti-Nrp1 blocking antibodies”. VEGF sti- 
mulated the proliferation of CD34* but not CD34” TECs, and this 
effect was suppressed by an anti-Nrp1 antibody that blocks its binding 
to VEGF, demonstrating that Nrp1 cell-autonomously mediated the 
effect of VEGF on CD34" CSC proliferation. This effect was specific for 
VEGF-Nrpl, as an anti-Nrpl antibody that blocks the Nrpl- 
semaphorin interaction was ineffective (Fig. 4m). Comparative qRT- 
PCR analysis of FACS-isolated CD34* CSCs expressing or lacking 
Nrp1 showed that Nrp1 was required for the upregulation of most 
stemness and proliferation genes in response to VEGF (Fig. 4n and Sup- 
plementary Fig. 15). Thus, autocrine production of VEGF and Nrp1 
expression by the TECs regulates the stemness of cutaneous tumours 
and the expansion of the CSC pool, which, together with the pro- 
angiogenic activity of VEGF, contribute to enhanced tumour growth. 

Our results indicate that VEGF has a dual role in the initial stage of 
skin tumour promotion. First, VEGF signalling through VEGFR2 in 
endothelial cells is critical to sustain angiogenesis and to create a 
vascular niche for CSCs. Whether endothelial cells in the perivascular 
niche produce angiocrine signals to foster stemness remains to be 
determined. Second, VEGF also acts directly on epidermal cells and 
cutaneous CSCs through a cell-autonomous mechanism depending on 
Nrp1, and hereby also increases the stemness and renewal potential of 
CSCs; this autocrine loop contributes to skin tumour initiation and CSC 
expansion in early skin tumours (Fig. 40). Such a dual role for VEGF in 
promoting stemness of CSCs in vivo by regulating both the vascular 
microenvironment and intrinsic stem-cell properties has important 
implications for the prevention and treatment of epithelial cancers. 


METHODS SUMMARY 


Tumour formation was performed by DMBA/TPA administration. Inhibition of 
VEGFR2 was performed by administrating anti- VEGFR2 antibodies to mice bearing 
papillomas. Gain and loss of VEGF and Nrp1 function in skin tumours was per- 
formed by administrating tamoxifen to K14CreER: VEGP’ , K14CreER:RosaVEGF- 
164 and K14CreER:RosaVEGF-164:Nrp¥”~ mice bearing skin tumours. The 
measurement of tumour size, cell proliferation, plane of cell division, apoptosis, 
differentiation and frequency of cancer stem cells were performed as described in 
Methods. 

After enzymatic digestion of the tumours, CD34* TECs were isolated or quan- 
tified by FACS using a combination of monoclonal antibodies. Secondary tumour 
assays were performed by transplanting CD34 TECs into immunodeficient mice 
treated with tamoxifen. mRNA expression was quantified by Affymetrix Mouse 
Genome 430 2.0 Array or real-time RT-PCR using the primers and protocol 
indicated in Methods. FACS-isolated CD34" and CD34” TECs were cultured 
in vitro and treated with VEGF, anti-VEGF and anti-Nrp1 blocking antibodies. 
For further details see Supplementary Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. FVB/N mice were obtained from Charles River Laboratories. K14Cre (ref. 
26) and K14CreER (ref. 27) transgenic mice were provided by E. Fuchs. K14CreER 
and K14Cre mice were mated with Rosa26-VEGF-164, Vefe" 1 (ref. 28) and 
Nrp1'’~ (ref. 29) mice. Mouse colonies were maintained in a certified animal 
facility in accordance with European guidelines. For each time point, at least three 
mice from two different litters were used to characterize the different phenotypes. 

For carcinogenesis, mice were treated with DMBA and TPA as previously 

described*®. Briefly, mice from the FVB/N background and from each Cre or 
CreER line were treated with DMBA at postnatal day 23, 25 and 27 with DMBA 
(9,10-dimethyl-1,2-benzanthracene) and then treated twice, weekly for several 
weeks with TPA (12-O-tetradecanoyl phorbol-13-acetate). Seven-week-old mice 
were treated again twice with DMBA. After 25 weeks of treatment, more than 99% 
of the mice of each genotype develop skin tumours except for the Nrp1 cKO mice, 
which fail to develop tumours. After 25 weeks, mice were treated with antibodies or 
tamoxifen and the size of tumours was followed by direct measurement. 
Measurement of tumour growth. Skin tumours were measured using a precision 
calliper allowing discrimination to size modifications >0.1 mm. Tumour volumes 
were measured the first day of treatment and every week until the day that they 
were humanely killed with the formula V = n[d* X D]/6, where d is the minor 
tumour axis and D is the major tumour axis. 
Histology, immunostaining and imaging. Tissues from FVB/N, CreER and Cre 
mice were either embedded in OCT and sections were fixed in 4% paraformalde- 
hyde for 10 min at room temperature, or pre-fixed for 2 h in 4% paraformaldehyde 
and embedded in OCT. Samples were sectioned at 4-6-j1m sections using a 
CM3050S Leica cryostat (Leica Microsystems). 

The following primary antibodies were used: anti-CD34 (rat, 1:100, BD), anti- 
K5 (rabbit, 1:2,000, Covance), anti-B4 (rat, 1:200, BD Biosciences), anti-Ki67 
(rabbit, 1:200, Abcam), anti-E-cadherin (rat, 1:1,000, Invitrogen), anti-K10 
(rabbit, 1:2,000, Covance), anti-endoglin (goat, 1:500, R&D), anti-Nrp1 (goat, 
1:100, R&D), anti- VEGFR2 (rat, 1:100, eBioscience), anti-active caspase 3 (rabbit, 
1:600, R&D) anti-BrdU (Rat, 1:100, BD), anti-K13 (mouse, 1:100, Abcam), anti- 
Hmga2 (rabbit, 1:500, Santa Cruz). EdU staining was performed following the 
manufacturer’s instructions (Invitrogen). BrdU or EdU were injected 4h before 
mice were humanely killed. Sections were incubated in blocking buffer (PBS/NDS 
5%, BSA 1%, Triton 0.2%) for 1h at room temperature. Primary antibodies were 
incubated overnight at 4 °C. Sections were rinsed three times in PBS and incubated 
with secondary antibodies diluted at 1:400 for 1h at room temperature. The 
following secondary antibodies were used: anti-rabbit, anti-rat, anti-goat conju- 
gated to Alexa Fluor 488 (Molecular Probes), to rhodamine Red-X (Jackson 
ImmunoResearch) or to Cy5 (Jackson ImmunoResearch). Nuclei were stained 
in Hoechst solution (1:1,000) and slides were mounted in DAKO mounting 
medium supplemented with 2.5% Dabco (Sigma). 

Pictures were acquired using Axio Imager M1 Microscope, AxioCamMR3 
camera and using Axiovision software (Carl Zeiss). 

Isolation of tumour epithelial cells. Tumours from FVB/N mice and mice from 
each Cre line were digested in collagenase I (Sigma) for 2h at 37 °C on a rocking 
plate. Collagenase I activity was blocked by addition of EDTA (5 mM) and then 
rinsed in PBS supplemented with 2% FCS. After tumour digestion, cells were first 
incubated in PBS complemented with 30% FCS to block Fc receptors for 15 min at 
room temperature. Immunostaining was performed using biotin-conjugated anti- 
CD34 (clone RAM34; BD Pharmingen), FITC-conjugated anti-«6-integrin (clone 
GoH3, BD Pharmingen), PE-conjugated anti-CD45 (clone 30F11, eBioscience), 
PE-conjugated anti-CD31 (clone MEC13.3, BD Pharmingen), PE-conjugated 
anti-CD140a (clone APAS, eBiosciences), and APC-Cy7-conjugated anti-Epcam 
(clone G8.8, Biolegend) by incubation for 30min on ice. A biotin-coupled 
VEGER2 antibody (clone avas12al, eBioscience) was used to determine the 
expression of VEGFR2 on the different cell populations. Cells were washed and 
stained using APC-conjugated Streptavidin (BD Pharmingen) for 20 min on ice. 
Living tumour cells were selected by forward scatter, side scatter and by Hoechst 
dye exclusion. Fluorescence-activated cell sorting analysis was performed using 
FACSAria and FACSDiva software (BD Biosciences). Sorted cells were collected 
either in culture medium for in vitro culture experiments or directly in the lysis 
buffer provided by the manufacturer (Microprep kit, RNAeasy, Stratagene), and 
RNA extraction was performed according to the manufacturer’s protocol. The 
entire procedure was repeated in at least three biologically independent samples. 
Microarray analysis. Total RNA was analysed using mouse whole-genome 430 2.0 
array from Affymetrix at the VIB microarray facility. Three different biological 
samples from CD34* and CD34 TECs were analysed and we compared 
CD34* to CD34” TECs. The merge of the genes upregulated (>1.5 times) in 
two among three samples provided a list of 526 genes described in the manuscript 
as the ‘CD34~ signature’. The same analysis was performed on CD34* TECs 
isolated by FACS from K14CreER:Rosa-VEGF-164 and control mice. The genes 


upregulated (>2 times) in VEGF-164 CD34* TECs compared to control CD34* 
TECs provided a list of 938 genes (the VEGF signature). The merge of the two 
comparisons—CD34" signature and VEGF signature—highlighted a list of 122 
common genes. Analysis of the microarray was performed by the VIB microarray 
facility. 

Reverse transcription and quantitative PCR (qPCR). Total RNA extraction and 
DNase treatment of samples were performed using the microprep kit (Stratagene) 
according to the manufacturer's recommendations. After nanodrop quantifica- 
tion, purified RNA was used to synthesize the first strand cDNA in a 50 pl final 
volume, using Superscript II (Invitrogen) and random hexamers (Roche). Control 
of genomic contaminations was measured for each sample by performing the same 
procedure with or without reverse transcriptase. qPCR analysis was performed 
with 4ng of cDNA reaction as a template, using a Brilliant II Fast SYBR QPCR 
Master Green mix (Stratagene) and an Agilent technologies Stratagene Mx3500P 
real-time PCR system. Relative quantitative RNA was normalized using the house- 
keeping genes f}-actin and Hprt. Primers were designed using Lasergene 7.2 soft- 
ware (DNAStar) and are presented below. Analysis of the results was performed 
using Mxpro software (Stratagene) and relative quantification was performed 
using the AAC, method using f-actin as a reference. The entire procedure was 
repeated in three biologically independent samples. Error bars represent standard 
error of the mean (s.e.m.). Results were presented as the fold change over CD34 
TECs isolated from the same tumours. 

List of primers used. Vegfr2 mRNA forward 5'-cagtggtactggcagctagaag-3’, 
reverse 5’-acaagcatacgggcttgttt-3’; Vegfa mRNA forward 5’-aaaaacgaaagcgcaag 
aaa-3', reverse 5’ -tttctccgctctgaacaagg-3’; Krt5 (K5) mRNA forward 5’-cagagctg 
aggaacatgcag-3’, reverse 5'-cattctcagccgtggtacg-3'; Cdh3 mRNA forward 5’- 
ccttggagetggaageaact-3’, reverse 5'-tgtccagccaaggctacttt-3’; Cdh] mRNA forward 
5'-cagaatgacaacaggccaga-3’, reverse 5'-ttcatcacggagettcctg-3'; Krt] mRNA for- 
ward 5’-tttgcctccttcatcgaca-3’, reverse 5'-gttttgegtccgggttgt-3’; Krt10 mRNA for- 
ward 5'-cgtactgttcagggtctggag-3’, reverse 5’-gcttccagcgattgtttca-3’; Nrp] mRNA 
forward 5'-ccacacacagtgggcttg-3’, reverse 5'-ggtccagctgtagetgcttc-3'; CD34 
mRNA forward 5’-gcaccactggttatttcctga-3’, reverse 5’-ttttcttcccaacagccatc-3’; 
TnC mRNA forward 5’-gggctatagaacaccgatge-3', reverse 5’-catttaagtttccaat 
ttcaggttc-3'; Epha7 mRNA forward 5’-ttaaaattgagcgtgtgattge-3’, reverse 5’- 
tcaaacgaccactgcaaact-3’; Aen mRNA forward 5'-acgtgaaacctgggaaagc-3’, reverse 
5'-agccagctcacaggttgg-3'; Ank3 mRNA forward 5’-cgaatgtcaacctgagcaataa-3’, 
reverse 5'-ccacattcactcggtcttctt-3’; Anln mRNA forward 5'-acaatccaagga 
caaacttgc-3’, reverse 5'-gcgttccaggaaaggctta-3’; Anol mRNA forward 5’- 
accaaggccaagtacagcat-3’, reverse 5’-tgcagctgagtatacgccatt-3'; Anxa3 mRNA for- 
ward 5’-aggctgatctttcaccttcg-3’, reverse 5'-aagccgagatcacagcaatc-3'; Atp10d 
mRNA forward 5’-gaggtggtgaaattgettcg-3’, reverse 5'-atcattggcaccgtcacc-3'; 
Cadm4 mRNA forward 5’-tgtgctgtcacagggaacc-3’, reverse 5'-caggccaggtagcgt 
gag-3', Camkld mRNA forward 5'-cctctacctggtcatgcaact-3’, reverse 5’- 
ctgtgtaaaaccccttctcca-3’; Ccna2 mRNA forward 5'-cttggctgcaccaacagtaa-3’, 
reverse 5'-caaactcagttctcccaaaaaca-3’; Cend1 mRNA forward 5'-gagattgtgccatc- 
catgc-3’, reverse 5’-ctcctcttcgcacttctgct-3'; Ceng] mRNA forward 5'-tggaca 
gattcttgtctaaaatgaag-3’, reverse 5'-cagtgggacattcctttcctc-3'; CD133 mRNA 
forward 5'-gaaggagcccagcttagagg-3’, reverse 5'-ggtcattcactcaaagtaccatcc-3'; 
Cdcl4b mRNA forward 5’-acttcgggccctgaaaag-3’, reverse 5'-tcagcacagct 
agggegacat-3'; Krt13 (K13) mRNA forward 5’ -agtcccagctgagcatgaa-3’, reverse 5'- 
gatgagcccctggatctgt-3'; Cdc5l mRNA forward 5’ -aacgcagtggaggaccatt-3’, reverse 
5'-gtccctttgggcagttttg-3’;  Cdkn2a mRNA forward _ 5’-gggttttcttggtga 
agttcg-3’, reverse 5'-ttgcccatcatcatcacct-3'; Cep55 mRNA forward 5’-tttcggct 
cctttgaacttg-3', reverse 5’-tctggaactcactacgtaacttgg-3'; Cldn4d mRNA forward 5’- 
gagggctgeegacctaga-3’, reverse 5'-gcaagacagtgcggaaaag-3'; Crabp2 mRNA for- 
ward 5'-ttgaggaaatgctaaaagctctg-3’, reverse 5'-tcctgtttgatctcgactgct-3'; Cyp4f39 
mRNA forward 5’-aaagaagaaagccaagagaattga-3’, reverse 5’-tgcaacaggtagtcggtg 
ag-3'; Ddit4l mRNA forward 5'-ctggtggtctccccacac-3', reverse 5'-tcattgcagt 
aagaggcacact-3'’; Dmrta2 mRNA forward 5'-agtctttggctcggtttge-3', reverse 5’- 
ttctgcaatttggcctctg-3'; Ecm1 mRNA forward 5’-tccagagcagccttgatctt-3’, reverse 
5'-atctctegctggtctgaage-3'; Eomes mRNA forward 5’-accggcaccaaactgaga-3’, 
reverse 5'-aagctcaagaaaggaaacatgc-3'; Ephx3 mRNA forward 5’ -tcccatgtcagtgatcc 
aag-3’, reverse 5'-tggaagtcagacatagacaacagc-3'; Esm1 mRNA forward 5’-cagtatge 
agcagccaaatc-3', reverse 5'-gatgctgagtcacgctctgt-3'; Exoc4 mRNA forward 5’- 
tgaccaacatcaccatgtca-3', reverse 5'-gctgtgttgtacagcatctcg-3’; Fads2 mRNA for- 
ward 5’-attcgggagaagatgctacg-3’, reverse 5'-aagaacttgcccacgaagtc-3'; Fam129a 
mRNA forward 5’-tacggctcatgggaaatgac-3’, reverse 5'-gctcctccatcaccagctta-3’; 
Farl mRNA forward 5'-cactgtgcggctactgtaaga-3', reverse 5'-agctgtcgtgtagctatc 
acattt-3'; Gpx2 mRNA forward 5’-gttctcggcttcccttgc-3’, reverse 5’-ttcaggatctc 
ctegttctga-3'; Grhl3 mRNA forward 5'-aaggaagatgtcgaatgaacttg-3’, reverse 5’- 
tegtcctcattactgtagggaaa-3'; Gstal mRNA forward 5’-cttctgacccctttccctct-3’, 
reverse 5'-gctgccaggctgtaggaac-3'; Gsta2 mRNA forward 5'-cagagtccggaag 
atttgga-3’, reverse 5'-agaatggctctggtctgcac-3'; Hmga2 mRNA forward 5'-aaggc 
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agcaaaaacaagagc-3’, reverse 5’-ccgtttttctccaatggtct-3'; Igf2bp2 mRNA forward 
5'-gggaaaatcatggaagttgacta-3’, reverse 5'-cgggatgttccgaatctg-3'; Kenn4 mRNA 
forward 5'-tctgcacgctgagatgttgt-3’, reverse 5’-accaggagcaggtacagcac-3'; Krt18 
mRNA forward 5'-agatgacaccaacatcacaagg-3', reverse 5’-tccagaccttggacttcctc-3’; 
Krt42 mRNA forward 5’ -tgcagatcgagagcctga-3’, reverse 5'-acgaagggcattcatttcc-3’; 
Lrrfipl1 mRNA forward 5'-gtttgccgaagtgaaagagg-3’, reverse 5'-aatttaggatt 
attccatgtttctcg-3'; Neto2 mRNA forward 5’-tccaccaaacaaggagtgtatct-3', reverse 
5'-agcgttcatcaaaggtcagc-3'; Nrp2 mRNA forward 5'-atggctggacacccaattt-3’, 
reverse 5'-atggttaggaagcgcaget-3'; Phida2 mRNA forward 5'-tcagcgctctgagtctg 
aaa-3', reverse 5’-tgggctcctgtctgatgc-3'; Prkg2 mRNA forward 5’-tttggcaaaaggc 
aaggt-3’, reverse 5’-tcaactgtggctgatcatgg-3'; Ptgr1 mRNA forward 5'-gactgagctcc 
cacccttaaa-3’, reverse 5'-gtaaggatccacagagaggaaca-3'; Ptprzl1 mRNA 5'-ggcact 
caggagtatccaaca-3’, reverse 5’-gaccaatacgagactcatggcta-3’; Rapgef3 mRNA for- 
ward 5'-ggtcaattctgccggtgat-3’, reverse 5’-gttgagccccagggatgt-3'; Sox2 mRNA 
forward 5’-tccaaaaactaatcacaacaatcg-3’, reverse 5'-gaagtgcaattgggatgaaaa-3'; 
Spp1 mRNA forward 5’-ggaggaaaccagccaagg-3’, reverse 5'-tgccagaatcagtc 
actttcac-3'; Krt14 mRNA forward 5’ -atcgaggacctgaagagcaa-3’, reverse 5’-tcgatctg 
caggaggacatt-3'; CD45 mRNA forward 5’-aatggctcttcagagaccacata-3’, reverse 
5' -agtcaggctgtgggeaca-3'; CD31 mRNA forward 5'-cggtgttcagcgagatcc-3’, reverse 
5'-cgacaggatggaaatcacaa-3'; pdgfra mRNA forward 5’-gtcgttgacctgcagtgga-3’, 
reverse 5'-ccagcatggtgatacctttgt-3'; CD11b mRNA forward 5'-aaggatgctggg 
gaggtc-3', reverse 5'-gtcataagtgacagtgctctgga-3'; vim mRNA forward 5’-cca 
accttttcttccctgaa-3’, reverse 5'-tgagtggetgtcaaccagag-3'; tek (tie2) mRNA forward 
5'-cataggaggaaacctgttcacc-3’, reverse 5'-cccacttctgagcttcacatc-3’. 

Culture of tumour epithelial cells. 3T3 feeder cells were y-irradiated and plated 
on 12-well plates. The next day, sorted CD34* and CD34~ epithelial tumour cells 
were plated on wells pre-cultured with 3T3 feeders cells in keratinocyte medium 
(MEM medium supplemented with 15% FBS, 0.4ugml~’ hydrocortisone, 
5 Ug ml insulin, 10 ng ml! EGE, 2X 10-°M T3, 1% penicillin/streptomycin, 
2mM L-glutamine). Twenty-four hours later, the cells were left untreated (control) 
or treated with VEGF (50 ng ml ', R&D), anti- VEGF-A (20 lug ml ', R&D) or 
anti-Nrp1 blocking antibodies (10 1g ml‘, Genentech). We used two distinct 
Nrp1-blocking antibodies: anti-Nrp1a, which blocks binding of semaphorins, and 
anti-Nrp1b, which blocks binding of VEGF-A. The medium was changed every 
other day for at least 1 week. At the end of the treatment, the feeders were blasted 
using PBS + EDTA (1 mM). The keratinocytes were trypsinized and counted. For 
induction of differentiation, untreated CD34° clones were cultured overnight in 
keratinocyte medium supplemented with only 2% of serum after blasting of the 
feeder cells. The next day, the cells were cultured in 2% serum medium for 48h 
with 1.7mM Ca’* to induce differentiation or kept in a low Ca’* medium 
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(0.07 mM). The keratinocytes were then trypsinized and RNA extraction was 
performed as described above. 

ELISA for VEGF-A. Control and K14CreER:Rosa26- VEGF-164 mice were killed 
lweek after tamoxifen injection (15mg per week) and tumours were excised 
and weighted before being dissociated mechanically at 4 °C in PBS supplemented 
with a cocktail of protease inhibitors (Roche Diagnostics). Tumour lysates were 
centrifuged at 13,000 r.p.m. min! for 5 min and VEGE-A concentration was 
assessed in the supernatant by ELISA (R&D Systems) following the manufac- 
turer’s recommendations. 

Secondary tumour assays. CD34~ TECs were isolated by FACS from control, 
K14CreER:Rosa26-VEGF-164 and K14CreER: VEGF" mice 1 week after tamoxifen 
injection (15 mg per week). Cells were harvested in 4°C medium supplemented 
with 20% serum and 1% penicillin/streptomycin. Cells were then washed in PBS 
complemented with 1% penicillin/streptomycin and re-suspended in matrigel 
(Sigma). 1,000 CD34* TECs re-suspended in 50 Ll of matrigel were injected sub- 
cutaneously to Nude mice (Charles River). Three injections per mouse were per- 
formed. Secondary tumours were detected by palpation and mice were killed 
3 weeks after grafting when tumours reached 50 mm’ to perform immunostainings. 
Symmetry of cell division. To determine the plane of cell division, we performed 
co-immunostaining for 4 integrin, which marks the basal lamina, and NuMA 
(Novus), which marks the pair of centrosomes at the opposite side of cells in late 
prophase/metaphase. Measurement of the angle formed by the basal lamina and 
the centrosomes allows us to determine whether mitoses are symmetric (0-40°) or 
asymmetric (50-90°). Angles between 40° and 50° have been counted and clas- 
sified as ‘undetermined’ to limit false-positive results. 

Statistics. Data represent mean + s.e.m. Statistical significance was calculated by 
Student’s t-test, ANOVA or Fisher’s exact test where indicated (Origin 7.0), con- 
sidering P < 0.05 as statistically significant. 
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Inhibition of miR-33a/b in non-human primates 
raises plasma HDL and lowers VLDL triglycerides 


Katey J. Rayner’, Christine C. Esau’, Farah N. Hussain!, Allison L. McDaniel’, Stephanie M. Marshall’, Janine M. van Gils!, 
Tathagat D. Ray’, Frederick J. Sheedy’, Leigh Goedeke', Xueqing Liu’, Oleg G. Khatsenko’, Vivek Kaimal*, Cynthia J. Lees*, 
Carlos Fernandez-Hernando!, Edward A. Fisher!, Ryan E. Temel** & Kathryn J. Moore!* 


Cardiovascular disease remains the leading cause of mortality in 
westernized countries, despite optimum medical therapy to reduce 
the levels of low-density lipoprotein (LDL)-associated cholesterol. 
The pursuit of novel therapies to target the residual risk has 
focused on raising the levels of high-density lipoprotein (HDL)- 
associated cholesterol in order to exploit its atheroprotective 
effects’. MicroRNAs (miRNAs) have emerged as important post- 
transcriptional regulators of lipid metabolism and are thus a new 
class of target for therapeutic intervention’. MicroRNA-33a and 
microRNA-33b (miR-33a/b) are intronic miRNAs whose encoding 
regions are embedded in the sterol-response-element-binding 
protein genes SREBF2 and SREBFI (refs 3-5), respectively. These 
miRNAs repress expression of the cholesterol transporter ABCA, 
which is a key regulator of HDL biogenesis. Recent studies in mice 
suggest that antagonizing miR-33a may be an effective strategy for 
raising plasma HDL levels** and providing protection against 
atherosclerosis®; however, extrapolating these findings to humans 
is complicated by the fact that mice lack miR-33b, which is present 
only in the SREBF1 gene of medium and large mammals. Here we 
show in African green monkeys that systemic delivery of an anti- 
miRNA oligonucleotide that targets both miR-33a and miR-33b 
increased hepatic expression of ABCA1 and induced a sustained 
increase in plasma HDL levels over 12 weeks. Notably, miR-33 
antagonism in this non-human primate model also increased the 
expression of miR-33 target genes involved in fatty acid oxidation 
(CROT, CPT1A, HADHB and PRKAAI) and reduced the expres- 
sion of genes involved in fatty acid synthesis (SREBF1, FASN, 
ACLY and ACACA), resulting in a marked suppression of the 
plasma levels of very-low-density lipoprotein (VLDL)-associated 
triglycerides, a finding that has not previously been observed in 
mice. These data establish, in a model that is highly relevant to 
humans, that pharmacological inhibition of miR-33a and miR- 
33b is a promising therapeutic strategy to raise plasma HDL and 
lower VLDL triglyceride levels for the treatment of dyslipidaemias 
that increase cardiovascular disease risk. 

Recent advances in the understanding of lipid metabolism have 
revealed that the genetic loci encoding the transcription factors 
SREBP1 and SREBP2 (known as SREBF1 and SREBF2) also encode 
the miRNAs miR-33b and miR-33a, respectively, which regulate 
cholesterol and fatty acid homeostasis together with their host 
genes**”~, Although miR-33a and miR-33b differ by two nucleotides 
in their mature form, they are identical in their seed sequence and thus 
are predicted to repress the same subset of genes. Notably, miR-33a 
has been highly conserved throughout evolution, whereas miR-33b is 
encoded only by the SREBF1 gene of medium and large mammals. 
This difference between mice and humans may be particularly relevant 
under conditions in which the transcription of SREBF1, and thus 


miR-33b, is highly upregulated, such as insulin resistance’. Recently, 
we and others have reported that the silencing of mature miR-33a in 
mice, by using modified antisense oligonucleotides*®, by viral delivery 
of hairpin inhibitors** or by targeted deletion of the miR-33-encoding 
locus”®, increased the levels of hepatic ABCA1 and circulating HDL by 
as much as 40%. Although these studies highlight the therapeutic 
promise of miR-33 inhibitors for raising plasma HDL levels, the 
absence of miR-33b in mice limits the translational relevance of these 
findings. 

Thus, to gain a comprehensive understanding of the effects of 
inhibiting both miR-33a and miR-33b in a model highly related to 
humans, we treated African green monkeys (Chlorocebus aethiops) 
with a 2’-fluoro/methoxyethyl (2'-F/MOE)-modified, phosphorothio- 
ate-backbone-modified, antisense miR33 (denoted anti-miR33), 
which we showed was equally effective at inhibiting both miR-33a 
and miR-33b in vitro (Supplementary Fig. 1a). Six animals per group 
were subcutaneously administered a clinically relevant dose of anti- 
miR-33 (5 mgkg ') or a mismatch control"! twice weekly for the first 
two weeks and then weekly for the remainder of the study (Fig. 1a). 
Quantification of hepatic anti-miRNA levels by ion-pairing high- 
performance liquid chromatography (HPLC) coupled to electrospray 
mass spectrometry (ES/MS) after 4 and 12 weeks of treatment showed 
equivalent delivery of anti-miR-33 and control (mismatch) oligonu- 
cleotides (Supplementary Fig. 1b). No toxicity seemed to be associated 
with the anti-miRNA treatment, as shown by the clinical chemistries, 
blood counts, coagulation markers, body weights and serum cytokine 
profiles of the monkeys (Fig. 1b and Supplementary Fig. 1c, d), which 
remained within normal limits throughout the study. 

Microarray profiling of messenger RNA obtained from liver biopsies 
after 4 weeks of treatment revealed that anti-miR-33 selectively increased 
the expression of miR-33 heptamer-matched genes in monkeys fed a 
normal chow diet (Supplementary Table 1). Of these, the gene encoding 
the cholesterol transporter ABCA1 was the most highly derepressed 
miR-33 target gene. Reverse transcription followed by quantitative 
PCR (RT-qPCR) analysis confirmed the increase in ABCA expression, 
as well as that of other known miR-33 target genes (target site alignment 
is shown in Supplementary Fig. 2), including the genes encoding two 
enzymes involved in fatty acid oxidation, CROT and HADHB, and the 
insulin signalling gene IRS2 (Fig. 1c and Supplementary Fig. 3). 

To assess the effects of miR-33 inhibition under different metabolic 
conditions, monkeys were switched after 4 weeks to a high carbohydrate, 
moderate cholesterol diet for 8 weeks, thereby totalling 12 weeks of 
treatment. After 8 weeks of the high carbohydrate, moderate cholesterol 
diet, SREBF1 mRNA levels increased by 5-fold in the control animals, 
and a corresponding 2.2-fold increase in miR-33b was observed, mak- 
ing its expression more than 7-fold higher than miR-33a (Fig. 1d and 
Supplementary Fig. 3). Microarray and RT-qPCR analysis showed that 


1Marc and Ruti Bell Vascular Biology and Disease Program, Leon H. Charney Division of Cardiology, Department of Medicine, New York University School of Medicine, New York, New York 10016, USA. 
Regulus Therapeutics, San Diego, California 92121, USA. 7Department of Pathology-Section on Lipid Sciences, Wake Forest University School of Medicine, Winston-Salem, North Carolina 27157, USA. 
*Department of Pathology-Section on Comparative Medicine, Wake Forest University School of Medicine, Winston-Salem, North Carolina 27157, USA. 


*These authors contributed equally to this work. 


404 | NATURE | VOL 478 | 20 OCTOBER 2011 


©2011 Macmillan Publishers Limited. All rights reserved 


Experimental outline 
a ° Seed b 


Anti-miR-33: T¢ CAATGCAACTACAATGCAC -e- Mismatch control 


LETTER 


Figure 1 | Silencing of miR-33a/b in non-human 
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the derepression of the above-mentioned miR-33 target genes by anti- 
miR-33 was largely sustained in monkeys fed a high carbohydrate, 
moderate cholesterol diet (Fig. 1c, Supplementary Fig. 3 and Sup- 
plementary Table 2). Under these diet conditions, we observed an 
increase in an additional miR-33 target gene involved in fatty acid oxida- 
tion, CPTIA (Fig. 1c and Supplementary Fig. 3). Although ABCGS5 and 
ATP8B1 are predicted to contain miR-33-binding sites, no difference 
in their mRNA levels was observed (Fig. 1c). Furthermore, throughout 
the study we observed no difference between the groups in the expres- 
sion of hepatic lipid metabolism genes lacking miR-33-binding sites, 
such as APOE and APOAI, or in ABCG1, which lacks the miR-33- 
binding site present in the mouse gene (Fig. 1c and Supplementary 
Fig. 3). Marked upregulation of ABCA] mRNA in anti-miR-33-treated 
monkeys was also observed in the spleen, which is a macrophage-rich 
tissue (Supplementary Fig. 1f). 

As miRNAs can affect both mRNA stability and translation, we 
measured hepatic ABCA1, CROT and CPT1A protein levels after 4 
weeks of chow diet treatment. All three of these miR-33 targets were 
present in increased amounts in the livers of monkeys that had been 
treated with anti-miR-33 compared to monkeys treated with mismatch 
control (Supplementary Fig. le). Furthermore, despite anti-miR-33 
having only modest effects on the amount of ABCA1 mRNA after 12 
weeks of treatment, hepatic ABCA1 protein levels remained robustly 
increased, as did the expression of CROT and CPT1A (Fig. le). 

Notably, although we observed no difference in SREBF2 expression 
in anti-miR-33-treated or control anti-miRNA-treated animals over 
the course of the study, we detected a 50% decrease in SREBF1 mRNA 
in the anti-miR-33-treated monkeys at 12 weeks (Fig. 1f and Sup- 
plementary Fig. 3), which was confirmed by western blotting (Fig. 1g). 
We postulated that this decrease in SREBP1 may result from the 
derepression of negative regulators of this pathway that are targeted 
by miR-33. Consistent with this hypothesis, we observed a fourfold 
increase in AMP-activated protein kinase (PRKAA1) mRNA in the 
livers of anti-miR-33-treated monkeys, whereas no change in sirtuin 
6 (SIRT6) MRNA was detected (Fig. 1h). SREBP1 has a major role in 


the transcriptional regulation of fatty acid synthesis, and measurement 
ofits downstream target genes revealed decreased mRNA levels of ATP 
citrate lyase (ACLY), acetyl-CoA carboxylase-x (ACACA) and fatty 
acid synthase (FASN) (Fig. 1f). 

As increased hepatic expression of ABCA1 would be predicted to 
augment HDL biogenesis, we measured plasma lipoprotein cholesterol 
levels. Whereas weekly blood sampling revealed no difference in total 
plasma cholesterol or LDL-associated cholesterol between treatment 
groups, there was both a significant decrease in VLDL-associated 
cholesterol and an increase in HDL-associated cholesterol in anti- 
miR-33-treated monkeys compared with mismatch-control-treated 
monkeys (Fig. 2a-d). A maximal HDL increase of 50% was reached 
after 8 weeks and was sustained throughout the remainder of the study 
(Fig. 2b). Correspondingly, lipoprotein separation by fast protein 
liquid chromatography (FPLC) showed increased cholesterol in the 
HDL fraction and a left-shifted HDL peak in anti-miR-33-treated 
monkeys compared with mismatch-control-treated monkeys, suggest- 
ing the presence of larger HDL particles (Fig. 2e). 

To characterize further the HDL, we examined the plasma concen- 
trations of apolipoprotein AI (apoAI), apoAII and apoE both in total 
plasma and in HDL fractionated by FPLC, by using enzyme-linked 
immunosorbent assays (ELISAs) and western blotting, respectively. By 
these two measures, we observed that anti-miR-33-treated monkeys 
had significant increases in the amounts of the primary apolipo- 
proteins carried on HDL—that is, apoAI and apoAII—associated with 
large and very large HDL particles compared with mismatch-control- 
treated monkeys (Fig. 3a, b and Supplementary Fig. 4). As the static 
measurement of HDL has inherent limitations in extrapolating to its 
functionality’, we examined the atheroprotective properties of anti- 
miR-33-generated HDL, namely its ability to promote cholesterol 
efflux by macrophages and to protect endothelial cells from cytokine- 
induced inflammation. Equivalent volumes of serum or polyethylene 
glycol (PEG)-isolated HDL from anti-miR-33-treated monkeys 
induced greater macrophage cholesterol efflux than did the same 
volumes from control monkeys (Fig. 3c), correlating with the plasma 
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Figure 2 | Plasma cholesterol levels in control-treated and anti-miR33- 
treated monkeys. a-d, Levels of total cholesterol (TC) (a), HDL cholesterol 
(HDL-C) (b), LDL cholesterol (LDL-C) (c) and VLDL cholesterol (VLDL-C) 
(d) in the plasma of monkeys treated with anti-miR-33 or mismatch control. *, 
P=<0.05; t+, P=0.1. e, Cholesterol content of FPLC-fractionated lipoproteins. 
Data are presented as the mean + s.e.m. 


HDL concentration in these animals. When normalized for cholesterol 
content, the PEG-isolated HDL from anti-miR-33-treated and control- 
treated monkeys showed similar acceptor capacity for cholesterol efflux 
from macrophages (data not shown). Furthermore, anti-miR-33- 
generated HDL maintained its anti-inflammatory effects on endothelial 
cells (Supplementary Fig. 4). 

Given the reciprocal effects of anti-miR-33 treatment on the expres- 
sion of genes involved in fatty acid oxidation and synthesis, we next 
measured plasma triglyceride levels. In anti-miR-33-treated monkeys, 
there was a striking reduction in plasma triglyceride levels compared 
with control-treated monkeys (Fig. 4a). This decrease was apparent as 
early as 4 weeks and reached a maximum reduction of 50% at the 
termination of the study. Fractionation of plasma lipoproteins revealed 
that this decrease derived primarily from a reduced level of VLDL 
triglycerides throughout the study and a decrease in LDL triglycerides 
at 12 weeks (Fig. 4b). VLDL particle analysis by NMR spectroscopy 
showed a decreased accumulation of large VLDL particles in anti-miR- 
33-treated monkeys compared with control-treated monkeys (Fig. 4c), 
with a corresponding decrease in apoB and apoE in the VLDL fraction 
(Fig. 4d). Thus, by simultaneously increasing fatty acid oxidation by 
derepression of HADHB, CPT1A and CROT and decreasing fatty acid 
synthesis by inhibition of the SREBP1 pathway, anti-miR-33 treatment 
leads to a pronounced reduction in plasma VLDL triglycerides. 

The development of novel therapies to exploit the atheroprotective 
properties of HDL is an area of intense investigation’. In randomized 
clinical trials, raising plasma HDL concentrations by augmenting 
apoAI levels or by treating with niacin has shown direct benefits in 
patients with coronary artery disease, including reducing cardiovascular 
event rates and plaque volume’. However, the development of HDL- 
raising drugs has proven particularly challenging’*. Previous studies by 
our group and others have shown that inhibiting mature miR-33a in 
mice is an effective strategy to raise HDL levels** and to enhance reverse 
cholesterol transport and induce the regression of atherosclerotic 
plaques®. Although promising, these studies in mice can provide only 
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Figure 3 | Characterization of HDL. a, Plasma apoAl and apoAII in monkeys 
treated with anti-miR-33 or mismatch control. *, P= 0.05. Mod chol / high 
carb, high carbohydrate, moderate cholesterol diet. b, HDL fractions obtained 
from FPLC fractionation of plasma (top) analysed by western blotting (bottom) 
for apoE, apoAI and apoAlI. L, large particles; M, medium particles; S, small 
particles; VL, very large particles. c, Macrophage cholesterol efflux into serum 
(2.5%) or PEG-isolated HDL from monkeys treated with anti-miR-33 or 
mismatch control. *, P= 0.05. Data are presented as the mean + s.e.m. 


limited translational insight because mice lack miR-33b expression, 
which may contribute substantially to miR-33 levels in humans. 

This study in non-human primates is the first to show that 
inhibiting both miR-33a and miR-33b has a profound and sustained 
effect on circulating HDL levels. Importantly, this study also estab- 
lishes that miR-33 antagonism markedly suppresses plasma VLDL 
triglyceride levels, partly as a result of regulating key genes involved 
in fatty acid oxidation”® and synthesis. Because low HDL levels and 
high VLDL triglyceride levels are commonly associated with metabolic 
syndrome’, miR-33 inhibitors may have clinical utility for the treat- 
ment of this growing health concern. Notably, as was recently reported 
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Figure 4 | Triglyceride and VLDL particle analysis. a, b, Level of total 
triglycerides (a), VLDL triglycerides, LDL triglycerides or HDL triglycerides 
(b) in the plasma of monkeys treated with anti-miR-33 or mismatch control. *, 
P=0.05; +, P=0.1. c, Quantification of the concentration of small, medium 
and large VLDL particles by NMR spectroscopy. *, P = 0.05. d, Western blot 
(top) of apoE and apoB in VLDL fractions (bottom) obtained from FPLC 
fractionation of plasma. Data are presented as the mean + s.e.m. 
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in mice’, the inhibition of miR-33 in monkeys also increased hepatic 
expression of IRS2, a key component of insulin signalling, which also 
becomes dysfunctional in individuals with metabolic syndrome’. As 
the monkeys used in this study were normoglycaemic, future studies in 
primate models of obesity and diabetes will be important to test the 
effects of miR-33 inhibition on insulin signalling. 

Taken together, our findings show that pharmacological inhibition of 
miR-33a and miR-33b leads to a sustained increase in plasma HDL 
cholesterol and a coincident decrease in VLDL triglycerides without 
any evidence of adverse effects. These findings in non-human primates 
support the development of antagonists of miR-33 as potential thera- 
peutics for dyslipidaemia, atherosclerosis and related metabolic diseases. 


METHODS SUMMARY 


All experiments were performed in accordance with National Institutes of Health 
guidelines for animal research and were approved by the Wake Forest University 
Health Science Institutional Animal Care and Use Committee. Male African green 
monkeys (n = 6 per group) were injected subcutaneously with 5mgkg ! 2'-F/ 
MOE antisense miR-33 or mismatch anti-miRNA (Regulus Therapeutics) twice 
weekly for 2 weeks and then weekly until the termination of the study. Monkeys 
were fed a weighed amount of chow or a high carbohydrate, moderate cholesterol 
semisynthetic diet. Serum and whole blood samples were analysed using 
Superchem and CBC tests (ANTECH Diagnostics), respectively. Plasma lipoprotein 
cholesterol and triglyceride distribution were determined by online gel-filtration 
HPLC. Pooled plasma was separated by FPLC on a Superose 6 10/300 GL column 
(GE Healthcare). Plasma lipoprotein particle number and size were determined 
using NMR spectroscopy (LipoScience). Plasma apoAl, apoAII and apoE levels were 
measured by ELISA. A detailed description of the RNA and protein analyses, as well 
as the HDL characterization assays, is in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


African green monkeys. Adult male African green monkeys (Chlorocebus 
aethiops) (n = 12, age 5-10 years) were obtained from St. Kitts island. Monkeys 
were housed in an AAALAC-accredited facility under the direct care of the Wake 
Forest University Health Sciences (WFUHS) Animal Resources Program and 
necropsied at the termination of the study. All experiments were approved by 
the WFUHS Institutional Animal Care and Use Committee (IACUC). Monkeys 
were singly housed in climate-controlled conditions with a 12h light and dark 
cycle. 

The monkeys were provided water ad libitum and were initially fed a weighed 
amount of a chow diet (Monkey Diet 5038, Lab Diet) twice daily, such that their 
daily caloric intake was 70 kcal day ' kg” ' body weight. To induce dyslipidaemia, 
the monkeys were fed twice daily with a weighed amount of a high carbohydrate, 
moderate cholesterol semisynthetic diet (Supplementary Table 3), which was 
prepared at the Wake Forest Primate Center Diet Laboratory, such that their 
daily caloric intake was 90 kcal day’ kg~' body weight. The use of a similar 
semisynthetic high-fructose diet in cynomolgus monkeys has been shown to 
increase plasma triglyceride levels". 

Liver biopsy. Liver samples were collected from chow-fed monkeys before and 
after 4 weeks of anti-miRNA treatment. After an overnight fast, the monkeys were 
initially anaesthetized with ketamine (10 mg kg” ' intramuscularly) and pretreated 
with atropine (0.04mgkg ' intramuscularly). After intubation, anaesthesia was 
maintained throughout the surgical procedure with isoflurane (3.5-5.0% for 
induction, and 1-2% for maintenance, by inhalation). The monkeys were shaved 
and prepared for surgery in accordance with standard veterinary medical practice 
(three sequential scrubs with Nolvasan and rinses with isopropanol, and a final 
spray with Betadine solution). Anaesthesia was monitored at 15-min intervals 
during the surgical procedure. If there was a change after the monkeys reached 
the desired plane of anaesthesia, anaesthesia was increased or decreased at the 
direction of the surgical veterinarian. Heart rate, oxygen saturation, end expired 
CO,, capillary refill time, respiration rate and temperature were recorded at 
15-min intervals or more frequently if needed. Under sterile conditions, an 
abdominal incision was made and a 1-g wedge of liver was taken. The monkeys 
were administered buprenorphine HCl (0.01 mgkg™' intramuscularly) for pre- 
emptive analgesia. The laparotomy incision was closed in a standard three-layer 
manner, with the abdominal wall closed using a synthetic absorbable suture in a 
simple interrupted pattern. The subcutis was similarly closed with a continuous 
suture pattern, using a synthetic absorbable material, and the skin was opposed 
with a continuous pattern of intradermally placed suture. The monkeys were 
returned to their cages and monitored during anaesthetic recovery in accordance 
with the WFUHS IACUC policy on survival surgery and post-surgical care. For 10 
days following surgery, the monkeys were monitored for post-operative pain, 
which was alleviated with buprenorphine HCl (0.01 mgkg ’ intramuscularly, 
twice daily) and ketoprofen (2.0-3.0 mgkg ’ intramuscularly, once daily). 
Anti-miRNA treatment. Regulus Therapeutics provided 2'-F/MOE-modified, 
phosphorothioate-backbone-modified antisense miR-33 (TGCAATGCAACTA 
CAATGCAC) and mismatch control (TCCAATCCAACTTCAATCATC) anti- 
miRNA (the mismatched bases are underlined). Monkeys were injected subcuta- 
neously with 5 mg anti-miRNA kg * body weight twice weekly during the first 2 
weeks and then once weekly during the remaining 10 weeks of the study. 
Monitoring for adverse effects. To monitor for any adverse effects of anti- 
miRNA treatment, body weights were measured weekly. In addition, at week 0, 
4, 8 and 12 of anti-miRNA treatment, analysed serum and whole blood samples 
were analysed by ANTECH Diagnostics, using the Superchem and CBC tests, 
respectively. 
RNA extraction and qPCR. Liver tissue was homogenized in 300 pl TRIzol 
(Invitrogen) with 1-mm zirconium silicate beads, using a Bullet Blender (Next 
Advance). Insoluble material and the beads were removed by centrifugation; the 
volume of TRIzol was brought up to 1 ml; and RNA was extracted according to the 
manufacturer’s protocol. RNA integrity was verified using Agilent’s Bioanalyzer. 
Reverse transcription was carried out on 1 ig total RNA using the iScript cDNA 
Synthesis Kit (Bio-Rad). qPCR was performed using the Eppendorf Mastercycler 
with primers directed against the genes listed in Supplementary Table 2. 
miR-33a and miR-33b detection was performed using a QuantiGene 2.0 miRNA 
Assay (Panomics) according to the manufacturer’s instructions. Total RNA (1-mg 
input) was used for each liver sample. Copy number quantification was done by 
generating a standard curve using miR-33a and miR-33b oligonucleotides synthe- 
sized by Integrated DNA Technologies. 
Affymetrix gene array analysis. Microarray analysis of liver biopsies from mon- 
keys treated with anti-miR-33 or a control anti-miRNA were performed in groups of 
six, using Human U133 Plus 2.0 Arrays (Affymetrix). Treatments were performed at 
three time points: 0, 4 and 12 weeks. Data from the arrays were normalized, quality 
controlled and compared using two-way analysis of variance (ANOVA) tests. Genes 


with a heptamer signature seed sequence that matched miR-33 were filtered using 
P values with an % < 0.05 and sorted by fold change. The miR-33 seed sequence was 
taken from miRBase (http://www.mirbase.org), and seed-matched genes were 
derived from genes with 3'-untranslated region (UTR) seed-sequence matches in 
the Ensembl] BioMart database (http://www.biomart.org). 

HPLC-ES/MS quantification of anti-miRNA. Anti-miRNA was quantified in 
liver samples by ion-pairing HPLC-ES/MS. Separation was accomplished using a 
1200 series HPLC-MS system (Agilent Technologies) consisting of a binary pump, 
a diode array ultraviolet detector, a column oven, an autosampler and a 6100 
Single Quadrupole MS. Typically, each sample (50mg) was extracted using a 
phenol-chloroform-isoamy] alcohol (25:24:1) extraction method followed by a 
two-step solid-phase extraction method (strong anion exchange followed by 
reverse phase C18 ). The extracted material was reconstituted in water and injected 
directly onto an XBridge OST C18 column (50 X 2.1mm, 2.5mm particles; 
Waters). The column was maintained at 55 °C, and the flow rate on the column 
was 0.1 ml min~'. The column was equilibrated with 25% acetonitrile in 5mM 
tributylammonium acetate, pH 7.0. A gradient from 30% to 60% acetonitrile over 
10 min was used to separately elute the compound of interest and the internal 
standard (a 27-amino acid 2’-F/MOE-modified compound). Peak areas were 
quantified online using single-ion monitoring mode with m/z = 1,868 for 
RG428651 and 1,843 for RG522293. Mass spectra were obtained using a drying 
gas flow rate of 121 min” ' at 350 °C, a nebulizer pressure of 35 psig and a capillary 
voltage of 4,000 V. Chromatograms were analysed using ChemStation software. 
Compound levels were back-calculated using a quadratic fit and calibration curve 
range of 15.6-500mgg | tissue. The lower limit of quantification (LLOQ) was 
equal to 31.3 mgg ' tissue. 

Protein extraction and western blotting analyses of liver. Liver tissue was 
homogenized in 500 ul RIPA buffer using a Bullet Blender, as described above. 
Lysates were cleared by centrifugation, and a total of 50 Ug protein was separated 
using SDS-PAGE and transferred to nitrocellulose or polyvinylidene difluoride. 
Membranes were blotted with antibodies against ABCA1 (rabbit; from M. 
Fitzgerald'*), CPT 1A (goat; Novus Biologicals), CROT (rabbit; Abcam) and tubulin 
(mouse; Sigma). Secondary antibodies labelled with IRDye800 or IRDye700 
(Rockland Immunochemicals for Research) were visualized using an Odyssey 
Imaging System (LI-COR). 

Cholesterol efflux assays. THP-1 cells (1x 10° per well) were differentiated in 
10nM phorbol myristate acetate for 72h in RPMI medium supplemented with 
10% FBS. Cells were loaded by incubation with 37.5 1g ml’ acetylated LDL 
(Biomedical Technologies) and labelled with 0.5 uCi ml! [?H]cholesterol 
(PerkinElmer) for 24h. Excessive label was removed by extensive washing with 
PBS before cells were equilibrated in 2 mg ml ' fatty-acid-free BSA in RPMI. To 
use as an acceptor in the efflux studies, HDL was isolated by combining pooled 
serum samples (from m = 6 monkeys treated with either mismatch control or anti- 
miR33) with 20% PEG (Sigma) followed by centrifugation, to precipitate the apoB- 
containing lipoproteins, as described previously’®. The cholesterol content of this 
PEG- isolated HDL was determined to be 526 1g ml for control anti-miRNA- 
treated monkeys and 863,1gml | for anti-miR-33-treated monkeys. Medium 
containing equal volumes (251) of this PEG-isolated HDL was added to the 
labelled cells for 6 h. Alternatively, 2.5% pooled serum from each group of monkeys 
was added to the cells as an efflux acceptor for 6 h. Supernatants were collected, and 
the *H was counted and expressed as a percentage of the total cell [*H] cholesterol 
content (total effluxed [*H]cholesterol + cell-associated [*H]cholesterol). Data are 
expressed as mean = s.d. of triplicate wells and represent an experimental n = 3. 
Luciferase assays. HEK293 cells were seeded 24h before transfection in 24-well 
plates. A plasmid containing the full-length 3’-UTR of ABCA] downstream of 
firefly luciferase (GeneCopoeia) was transfected into cells in the presence or 
absence of the following vectors: pre-miR-33a, pre-miR-33b or a control 
miRNA (System Biosciences) along with 20nM of either the mismatch anti- 
miRNA or anti-miR-33. Twenty-four hours after transfection, cells were collected, 
and the luciferase activity was measured using the Dual-Glo Luciferase Assay 
System (Promega). Firefly luciferase activity was normalized to Renilla luciferase 
activity as a control for transfection efficiency. Data are expressed relative to the 
ABCA] 3’-UTR activity in the presence of a control miRNA and are mean + s.d. 
of triplicate samples of an experimental n = 3. 

Plasma lipid, apolipoprotein and lipoprotein concentrations. Monkeys were 
sedated with ketamine (10 mgkg ' intramuscularly), and blood was collected in 
EDTA-containing Vacutainers. Plasma was isolated by centrifugation at 1,500g for 
30 min at 4°C. Total plasma cholesterol and triglyceride levels were measured 
using a Cholesterol Reagent Set (Pointe Scientific) and Triglyceride Reagent and 
Free Glycerol Reagent (Sigma), respectively. The plasma lipoprotein cholesterol 
and triglyceride distributions were determined by online gel-filtration HPLC’, 
using Infinity Cholesterol or Infinity Triglycerides reagent (Thermo). The plasma 
lipoprotein particle number and size were determined by LipoScience using NMR, 
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as described previously’’. Plasma apoAI, apoAII and apoE levels were measured by 
ELISA”. 

Plasma cholesterol and apolipoprotein distribution. An equal volume of 
plasma from each monkey within a treatment group was pooled. The pooled 
plasma was separated on a Superose 6 10/300 GL column (GE Healthcare) at a 
flow rate of 0.4ml min '. From 20 to 60 min post-injection, fractions were col- 
lected at 1-min intervals. The total cholesterol content of the fractions was deter- 
mined using a Cholesterol Reagent Set (Pointe Scientific). An equal volume of each 
fraction was mixed with 5X SDS sample buffer and separated on a Novex 
NuPAGE 4-12% Bis-Tris Midi Gel using 1 NuPAGE MOPS SDS running buffer 
(Invitrogen). The proteins were transferred to a nitrocellulose membrane, which 
was subsequently blocked with 5% (w/v) non-fat dried milk dissolved in wash 
buffer. The membrane was incubated overnight at 4°C in one or more of the 
following goat anti-monkey affinity-purified antibodies: anti-apoAI, anti-apoAII, 
anti-apoB or anti-apoE. All anti-monkey apolipoprotein antibodies were created 
and tested for specificity at WFUHS. Secondary antibodies against goat IgG were 
conjugated to horseradish peroxidase (Sigma), and proteins were visualized with 
ECL reagent (PerkinElmer) and exposure to Blue X-Ray Film (Phenix). 

Plasma cytokine analysis. The levels of cytokines in plasma samples were measured 
using a Human Proinflammatory 7-Plex Assay Ultra-Sensitive Kit (Meso Scale 
Discovery). The assay was done according to the manufacturer’s assay procedure. 
Briefly, 25 tl monkey plasma samples or kit calibrators were added to assay diluent in 
a 96-well 7-plex assay plate. After incubation for 2h, the plate was washed then 
incubated with detection antibody solution for 2h. Read Buffer was added to each 
well in the assay plate, and the signal was analysed by a SECTOR Imager (Meso Scale 
Discovery). Seven cytokines, including interleukin-1 (IL-1), IL-12p70, interferon-y, 
IL-6, IL-8, IL-10 and tumour-necrosis factor-c, were analysed simultaneously by the 
imager. The data were quantified using a standard curve generated from the kit 
calibrators for each of the cytokines. 

Flow cytometric analysis of endothelial VCAMI1 and E-selectin. Human 
umbilical vein endothelial cells (HUVECs; Lonza) were cultured in EGM-2 
according to the manufacturer's protocol. HUVECs (passage 3-5) were plated 
in 24-well plates, with 1 X 10° cells in 400 pl EGM-2 containing 20% FBS per well. 
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After a 5-h re-attachment period, cells were pre-incubated for 16h with HDL 
isolated from pooled serum samples (n = 6), after which TNF-« (0.5 ng ml}; 
PeproTech) was added to the culture medium for an additional 4h. Cell-surface 
expression of VCAM1 and E-selectin was then measured by flow cytometry, using 
an anti-VCAM1 monoclonal antibody (BD Biosciences) followed by an anti- 
mouse fluorescein isothiocyanate (FITC)-conjugated antibody (Sigma) or an 
anti-E-selectin phycoerythrin (PE)-conjugated antibody (Chemicon). Cells were 
detached using 5 mM EDTA in PBS. Cellular FITC and PE were then analysed by 
flow cytometry (with an Accuri Cé Flow Cytometer; BD Biosciences). Controls 
included an isotype-matched control antibody and no primary antibody. Native 
HDL, (isolated by sequential ultracentrifugation of human plasma) at a concen- 
tration of 0.5 mg ml ' protein was used as a positive control. 

Statistical analyses. All comparisons were made using Student's t-test (P = 0.05), 
and data are expressed as mean + s.e.m., unless otherwise noted. Data from the 
arrays were normalized, quality controlled and compared using two-way ANOVA 
tests. 
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Crystal structure of a bacterial homologue of the bile 
acid sodium symporter ASBT 


Nien-Jen Hu’?*, So Iwata’??*°, Alexander D. Cameron’??* & David Drew! 


High cholesterol levels greatly increase the risk of cardiovascular 
disease. About 50 per cent of cholesterol is eliminated from the 
body by its conversion into bile acids. However, bile acids released 
from the bile duct are constantly recycled, being reabsorbed in the 
intestine by the apical sodium-dependent bile acid transporter 
(ASBT, also known as SLC10A2). It has been shown in animal 
models that plasma cholesterol levels are considerably lowered by 
specific inhibitors of ASBT’”, and ASBT is thus a target for hyper- 
cholesterolaemia drugs. Here we report the crystal structure of 
a bacterial homologue of ASBT from Neisseria meningitidis 
(ASBTym) at 2.2 A. ASBTym contains two inverted structural 
repeats of five transmembrane helices. A core domain of six helices 
harbours two sodium ions, and the remaining four helices pack in a 
row to form a flat, ‘panel’-like domain. Overall, the architecture of 
the protein is remarkably similar to the sodium/proton antiporter 
NhaA’, despite having no detectable sequence homology. The 
ASBT ym Structure was captured with the substrate taurocholate 
present, bound between the core and panel domains in a large, 
inward-facing, hydrophobic cavity. Residues near this cavity have 
been shown to affect the binding of specific inhibitors of human 
ASBT". The position of the taurocholate molecule, together with 
the molecular architecture, suggests the rudiments of a possible 
transport mechanism. 

ASBT is an SLC10 (sodium bile acid co-transporter family) member 
that moves bile acids across the apical membrane of the ileum into the 
portal blood vein*®. ASBT uses the sodium ion gradient to drive the 
‘uphill’ transport of bile acids across membranes, with a reported 
stoichiometry of two sodium ions per substrate’. Mutations in the 
human ASBT gene cause a condition of primary bile acid malabsorp- 
tion®. ASBT is a pharmaceutical target for drugs aimed at lowering 
cholesterol, and several ASBT inhibitors have been developed that are 
effective in animal models'”. Because some drugs are poorly absorbed 
in the intestine or need to be targeted to the liver, ASBT and its close 
liver paralogue, NICP (SLC10A1), have also received attention as 
prodrug carriers, capable of transporting various compounds coupled 
to bile acid, for example HMG-CoA reductase (HMGCR) inhibitors, 
the antiviral drug acyclovir, nucleotides and cytostatic drugs’. 

ASBT xm from N. meningitidis, with 26% identity and 54% similarity 
to human ASBT, was identified by fluorescence-based screening 
methods’*” as a suitable candidate for structural studies (Supplemen- 
tary Figs 1 and 2). Residues known to be functionally important in 
mammalian ASBT and other SLC10 members” are well conserved in 
ASBTym (Supplementary Fig. 1). Bile acid transport by ASBT yyy was 
confirmed in whole cells by the sodium-dependent uptake of [*H]- 
taurocholate (Fig. 1a). The observed Michaelis constant, K,,, for PH]- 
taurocholate is ~50 IM (Fig. 1b), a value similar to that measured for 
rat and human ASBT”**, The ASBT inhibitors cyclosporin A’ and 
bromosulfophthalein’”’ and the drug fluvastatin’® are also competitors 
for ASBTyy-mediated [*H]-taurocholate transport (Fig. 1c). Thus, 
ASBTym is a valid model of mammalian bile acid transporters. We 


solved the structure of ASBT\yy by single-wavelength anomalous scat- 
tering and refined it at a resolution of 2.2 A (Supplementary Tables 1 
and 2; Methods). 

ASBTym has cytoplasmic amino and carboxy termini, comprises 
ten transmembrane helices (TMs) that are linked by short loops, and 
has overall dimensions of approximately 45 A x 30 A X 30A (Fig. 2 
and Supplementary Fig. 3). TM1 to TM5 and TM6 to TM10 are 
topologically similar but oppositely orientated in the plane of the 
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Figure 1 | Sodium-dependent transport of bile acid by ASBT yw. a, Time- 
dependent uptake of [?H]-taurocholate after expression of ASBT yyy in 
Escherichia coli, as monitored in buffer containing 137 mM sodium (filled 
circles) or <1 mM sodium (open circles) b, Michaelis-Menten transport 
kinetics of ASBTy-mediated [°H]-taurocholate uptake. The specific uptake 
(filled circles) was calculated by subtracting the internalization measured from 
control cells lacking the transporter (open squares) from the total uptake (open 
circles), as detailed in Methods. c, ASBTyy4-mediated [*H]-taurocholate uptake 
after 5 min in the presence of 150 [1M taurocholate, cyclosporin A, fluvastatin or 
bromosulfophthalein (black bars), measured as a percentage of the uptake 
without their addition (white bar). d, ASBT\,,-mediated ["H]-taurocholate 
uptake after 5 min for a wild-type control (white bar) and the indicated single 
alanine point mutants (black bars). The uptake for the mutants is displayed as a 
percentage of the wild-type activity. The expression and detergent-solubilized 
folded state of each mutant was similar to wild-type protein (Supplementary 
Fig. 2a). Errors bars, s.e.m.; n = 3. 
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Figure 2 | ASBT yy structure. a, Ribbon representation of ASBT yy as viewed 
in the plane of the membrane. TM1 to TM10 have been coloured from red at 
the N terminus to blue at the C terminus, and the position of the membrane is 
depicted in grey. The pink spheres indicate sodium sites Nal and Na2, and the 
stick model represents the substrate taurocholate. b, ASBT yyy structure as 
viewed from the intracellular side as a ribbon representation (left) and as a 
simplified cartoon (right). On the right, the burgundy pentagon represents 
taurocholate. 


membrane. The root mean squared deviation (r.m.s.d.) after super- 
position of the two topology-inverted repeats is 3.7 A (Supplementary 
Fig. 4a, b; Methods). Each repeating unit is made of an N-terminal 
V-motif (TM1 and TM2, TM6 and TM7) and a core motif of three 
helices (TM3 to TM5, TM8 to TM10) (Fig. 2 and Supplementary Figs 3 
and 4). Ifthe V-motif and the core motif are superposed separately, the 
r.m.s.d. in each case is lower: 2.6A and 2.8 A, respectively (Sup- 
plementary Fig. 4c). The core motifs from each repeat form the core 
domain, whereas the two V-motifs create a panel-like domain 
(Fig. 2b). TM4 and TM9 in the core domain are broken in the middle 
(discontinuous) and form helical hairpins with TM5 and TM10, 
respectively, which are both kinked. At the point where TM4 and 
TM9 are broken by well-conserved peptide motifs, they cross over 
(Fig. 2 and Supplementary Figs 5 and 6). On the intracellular side, a 
wide crevice separates the core domain from the panel domain 
(Fig. 3a). The cavity extends over halfway through the protein. The 
extracellular side of the cavity is tightly closed by TM1, TM2, TM4b, 
TM7, TM9b and TM10. Previously, two topology models of ASBT 
were proposed with seven and nine transmembrane helices, respec- 
tively’”"*. Because TM] is not conserved in ASBT, the structure is 
broadly consistent with the model with nine transmembrane helices 
(Supplementary Fig. 5). TM4 and TM9 were annotated as extracellular 
loops in the topology model with seven transmembrane helices, but 
were correctly identified in the model with nine transmembrane helices. 

Discontinuous transmembrane helices are a common motif in 
secondary active transporters*’’°. However, the sodium/proton anti- 
porter NhaA is the only other known example in which these helices 
cross as observed in ASBTym (Supplementary Fig. 6). Indeed, 
ASBTym has a similar structure to NhaA, and they superpose with 
an ram.s.d. of 2.9A over 202 Co atoms (Supplementary Fig. 7a; 
Methods). The similarity is more striking when the core and panel 
domains are superposed separately (Supplementary Fig. 7b). This 
unexpected finding further emphasizes the remarkable plasticity of 
transporters, which allows them to use a common scaffold to translo- 
cate different substrates”. 
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Figure 3 | ASBT yy structure is inward facing and contains bound sodium 
and bile acid. a, Surface representation showing the location of the 
intracellular cavity in which taurocholate binds, as a section through the 
protein. b, The sodium-binding sites in ASBT yy. Nal is octahedrally 
coordinated by Ser 114 and Asn 115 on TM4b, Thr 132 and Ser 128 on TMS, 
and Glu 260 on TM9a. The square pyramidal arrangement of the Na2 ligands is 
made up of Glu 260, Val 261, Met 263 and Gln 264 on TM9 and Gln 77 on TM3. 
c, The intracellular cavity in ASBT ym. Residues lining the cavity and near to the 
taurocholate are shown. The figures have been coloured as in Fig. 2. A 150-fold 
difference in inhibition of the mouse and human forms of ASBT by 
benzothiazepines* has been ascribed to sequence differences corresponding to 
Ser 291 at the bottom of the cavity. Supplementary Fig. 10 shows stereo versions 
of b and c. 


In ASBT and NTCP, two sodium ions are translocated per bile acid 
molecule”*’. In the highly conserved core domain of ASBTym 
(Supplementary Fig. 8), we have identified two sodium-binding sites 
(Nal and Na2) on the basis of the coordination and bond distances 
(2.0-2.5 A) (Fig. 3b and Supplementary Figs 9a and 10a; Methods). 
Nal is located approximately 10A from the cytoplasmic surface 
between TM4b and TM5, but also interacts with the carboxylate moiety 
of Glu 260 on TM9a (Fig. 3b and Supplementary Fig. 10a). The Na2 site 
is located 8 A from Nal, near the centre at the crossover points of TM4a 
with TM4b and TM9a with TM9b. Four backbone carbonyl oxygen 
atoms coordinate Na2, including Glu 260 on TM9a and the side chains 
of Gln 264 on TM9a and Gln 77 on TM3. The residues for which the 
side chains interact with the two sodium ions are completely conserved 
in ASBT and NTCP (Supplementary Figs 5 and 8). The glutamate 
residue equivalent to Glu 260 is essential for activity in ASBT and 
NTCP”. In ASBTym; its replacement with alanine significantly 
affects transport, as does the mutation of Gln 77 to alanine (Fig. 1d 
and Supplementary Fig. 2a). Thus, it seems that both sodium ions are 
required for efficient transport. Mechanistically, it is almost certainly 
necessary for sodium to be present at the Na2 site to neutralize the 
partial negative dipole of TM9a and, by doing so, stabilize the 
interaction with TM4a. Neutralization of the helix dipoles seems a 
conserved feature for this fold. In NhaA, the corresponding trans- 
membrane helix is thought to be neutralized by the positive charge 
of Lys 300, which is essential for transport*”’. 

The substrate-binding cavity is open to the cytoplasm and is 
approximately 6A X12AX 14A in size, with a solvent-accessible 
volume of 550 A? (Fig. 3a; Methods). Because the N-terminal half of 
TM1 is markedly bent outwards, it is more open on one side. The 
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cavity is much bigger than taurocholate, perhaps reflecting the large 
variety of compounds that are recognized by ASBT”"*"® (Fig. 3a, c). It is 
predominantly hydrophobic, but near the bottom there are a number 
of polar residues and water molecules (Fig. 3c and Supplementary 
Fig. 10b). As judged from high B-factors, taurocholate seems to be 
weakly bound (Supplementary Table 2 and Supplementary Fig. 9b). 
Consistent with this observation, there is only one direct hydrogen 
bond between ASBTy» and taurocholate, from Asn 295 on TM10 to 
the 7a hydroxyl group. The mutation of Asn 295 to alanine causes a 
dramatic reduction in taurocholate transport (Fig. 1d and Supplemen- 
tary Fig. 2a). Water molecules bridge the 7 hydroxyl with His 294 and 
the 3a hydroxyl with Asn 265, which is located at the crossover region 
of TM9. Thr 112 is also in the vicinity of the 3x group but cannot be 
unambiguously placed. The 12« hydroxyl group does not have any 
apparent hydrogen-bonding partner. The taurine moiety binds 
between TM1 and TM10. Interaction of the taurocholate with residues 
in TM10 is in agreement with biochemical data, which indicate that the 
last helix in ASBT has a dominant role in the translocation process™*. 
The location of Asn 265 between the TM4b and TM9b dipoles suggests 
that it may have a role in the mechanism. The importance of this 
residue has been inferred from mutagenesis studies on NTCP”. If 
Asn 265 is replaced by alanine in ASBTy™, transporter activity is 
reduced by ~80% (Fig. 1d and Supplementary Fig. 2a). Although there 
are clear similarities between the binding sites in ASBT\yy and ASBT, 
there are also sequence differences (Supplementary Fig. 5). Such dif- 
ferences may affect substrate specificity. 

For transport to take place, the protein must switch between out- 
ward- and inward-facing states*’. The architecture of ASBT yy pro- 
vides a clue to understanding how this might occur. The sodium ions 
are located in the core domain, close to the crossover points of the 
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Figure 4 | Putative mechanism for ASBT ym transport. a, Superposition of 
the ASBTyy structure (red, panel domain; blue, core domain) and the 
outward-facing model as described in the text (light grey). The superposition 
has been optimized on the core domains. Loops have been removed for clarity. 
In the right-hand image, the panel domain of the model has been rotated by 25° 
relative to the core domain, around the axis shown in the left-hand image, to 
superimpose the panel domains. Significant kinks in the helices are represented 
as breaks. The area of the cavity is depicted by a tan-coloured trapezoid. 
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discontinuous helices and occluded from the bulk solvent. In NhaA, 
sodium binding causes a rearrangement of these helices**’”. In 
ASBTyxm, similar rearrangements in the core domain are therefore 
likely. Because NhaA translocates only protons and sodium ions” 
these helix movements might be sufficient for transport. However, 
ASBTym™ transports much larger substrates, and structural movements 
in more than the core domain are therefore needed. For the sodium- 
coupled transporter LeuT, the internal asymmetry of the repeating 
motifs has been used to predict global movements from a single 
structure*’, and these movements have been substantiated by crystal- 
lographic studies”. In an analogous manner to LeuT, an outward- 
facing model of ASBT\m was generated by superimposing TM1 to 
TM5 on TM6 to TM10, and vice versa (Fig. 4a; Methods). Comparing 
the inward-facing ASBT yy, structure with the outward-facing model, 
the largest difference is the position of the panel domain relative to the 
core domain (Fig. 4a, c). A route through the protein between these 
domains is in agreement with experimental data, which suggest that 
the final helix of ASBT and TM9 (IX) of NhaA line the transport 
pathway***?°°, Notably, the NhaA domain equivalent to the panel 
domain is located between that of the outward-facing and inward- 
facing ASBTy states (Fig. 4b). This may be because NhaA trans- 
locates a much smaller substrate, or it could represent another 
conformation of the transporter, probably an occluded state. 

We propose that sodium binding controls the conformation of the 
core domain of ASBTy™, which in turn drives the movement of the 
panel domain. This large conformational change of the panel domain 
relative to the core domain is required to alter the accessibility of 
the substrate-binding pocket. The ASBTy structure should aid 
the design of new inhibitors against ASBT with the goal of treating 
hypercholesterolaemia. 


XIc 
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Transmembrane helices are numbered from 1 to 10; those coloured grey and 
labelled with an asterisk represent the helices in the outward-facing model. 

b, NhaA shown in the same view as ASBT yy in a. The core domain is shown in 
light blue and the panel domain is shown in maroon. The two additional 
transmembrane helices and {-strands that are not present in ASBT yy are 
shown in grey. The position where sodium is thought to bind’ is shown with a 
black ring. c, Proposed transport mechanism, illustrating the movement of the 
panel domain relative to the core domain to transport sodium and bile acid. 
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METHODS SUMMARY 


ASBTym was cloned into a cleavable green fluorescent protein (GFP)/Hisg fusion 
vector, pWaldoGFPe”’. The fusion protein was expressed in E. coli, solubilized in 
1% dodecyl-f-p-maltopyranoside and purified to homogeneity. Before crystalliza- 
tion, untagged ASBTyy, was exchanged into 0.06% n-dodecyl-N,N-dimthylamine- 
N-oxide by size-exclusion chromatography. Crystals were grown in the presence of 
10mM taurocholate by the vapour diffusion method. Data were collected on 
beamlines 102 and 103 at the Diamond Light Source, UK, dehydration of the 
crystals being necessary to collect high-resolution data. The protein was deriva- 
tized by short-soaking a surface-engineered cysteine mutant (ASBTy™_1) with 
1mM mercury acetate. The structure of ASBT\yw1 was solved by single- 
wavelength anomalous dispersion and subsequently refined against data collected 
from ASBT ym at a resolution of 2.2 A. The cell-based bile acid uptake assay for 
ASBTym was modified from that previously described®. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 
ASBTym sequence. MNILSKISSFIGKTFSLWAALFAAAAFFAPDTFKWAGPY 
IPWLLGIIMFGMGLTLKPSDFDILFKHPKV VIIGVIAQFAIMPATAWLLSKLL 
NLPAEIAVGVILVGCCPGGTASNVMTYLARGNVALSVAVTSVSTLISPLLTP 
AIFLMLAGEMLEIQAAGMLMSIVKMVLLPIVLGLIVHKVLGSKTEKLTDALP 
LVSVAAIVLITIGAV VGASKGKIMESGLLIFA V VVLHNGIGYLLGFFAAKWTG 
LPYDAQKTLTIEVGMQNSGLAAALAAAHFAAAPVVAVPGALFSVWHNISG 
SLLATYWAAKAGKHKKPGSENLYFQ 
ASBTyxm_1 Sequence for structure solution. MVAASMNILSKISSFIGKTFSLW 
AALFAAAAFFAPDTFKWAGPYIPWLLGIIMFGMGLTLKPSDFDILFKHPKV 
VIUGVIAQFAIMPATAWCLSKLLNLPAEIA VGVILVGCCPGGTASNVMTYL 
ARGNVALSVAVTSVSTLTSPLLTPAIFLMLAGEMLEIQAAGMLMSIVKMVL 
LPIVLGLIVHKVLGSKTEKLTDALPLVSVAAIVLIIGA VVGASKGKIMESGLL 
IFAVVVLHNGIGYLLGFFAAKWTGLPYDAQKALTIEVGMQNSGLAAALAA 
AHFAAAPVVAVPGALFSV WHNISGSLLATYWAAKAGKHKKPLDRAGSEN 
LYFQ 
Expression screening, mutagenesis and protein purification. Bacterial ASBT 
homologues were cloned as GFP-Hisg fusions into the vector pWaldoGFPe"®, as 
fluorescence from a C-terminal GFP fusion is a reliable reporter of membrane- 
integrated expression’. Fusions were overexpressed in E. coli C43(DE3) cells by 
the addition of 0.4mM IPTG at Agoonm 0.4. The temperature was decreased to 
25°C for overnight induction. The monodispersity of expressed fusions were 
screened in crude DDM., decyl-B-b-maltopyranoside-, nonyl-B-b-maltopyranoside-, 
LDAO- or dodecyl nonaethylene glycol ether (C,,E,)-solubilized membranes by 
fluorescence-detection size-exclusion chromatography (FSEC). The ASBTym 
homologue from N. meningitidis (MC58) was selected for structural studies on 
the basis of the amount of protein produced, as judged by whole-cell*' and in-gel 
fluorescence’ and the quality of the FSEC trace in different detergents. Site-directed 
mutants of ASBTyy were generated by PCR (Quickchange, Agilent Technologies). 
Wild-type ASBTxm and mutants were purified essentially as previously 
described**. In brief, membranes were isolated from 10-1 E. coli cultures and 
solubilized in 1% DDM for 2h in buffer containing <1 PBS, 150 mM NaCl and 
10 mM imidazole. The suspension was cleared by ultracentrifugation at 120,000g 
for 1h. The sample was mixed with 1 ml of Ni-NTA Superflow resin (Qiagen) per 
1 mg of GFP-Hisg and incubated for 2h at 4 °C. Slurry was loaded onto a glass 
Econo-Column (Bio-Rad) and washed in X1 PBS buffer containing 0.1% DDM, 
150 mM NaCl and 20 mM imidazole for 20 column volumes. Bound material was 
washed for a further 20 column volumes in the same buffer containing 50 mM 
imidazole. The ASBT\yj—GFP-Hissg fusion was eluted in two column volumes of 
the same buffer containing 250 mM imidazole. The eluted protein was dialysed 
overnight in the presence of stoichiometric amounts of Hisg-tagged tobacco etch 
virus protease in 31 of buffer containing 20 mM Tris-HCl, pH 7.5, 150 mM NaCl 
and 0.03% DDM. Dialysed sample was passed through a 5-ml Ni-NTA His-Trap 
column (GE Healthcare), and the flow-through containing ASBT yu was collected. 
Protein was concentrated to 10mgml~' using concentrators with a relative 
molecular mass cut-off of 100K, and was loaded onto a Superdex 200 10/300 gel 
filtration column (GE Healthcare) equilibrated in 20mM Tris-HCl, pH 7.5, 
150mM NaCl and 0.06% LDAO. The choice of the detergent LDAO was con- 
sidered suitable for crystallization by comparing FSEC® and stability data® for 
ASBTym with membrane proteins known to crystallize in this detergent’’. The 
protein peak was collected and concentrated to 20 mg ml ' for crystallization. 
Transport time course. E. coli cells harbouring wild-type ASBTN—GFP-Hisg 
were collected and resuspended in uptake buffer consisting of 1 mM CaCl,, 1 mM 
MgCl, 10mM Tris-HCl, pH 7.5, and either 137mM NaCl (Na*-containing 
buffer) or 137mM choline chloride (Na* -low buffer). Cells were incubated at 
37°C with uptake buffer containing 411M taurocholate supplemented with 
0.16 uM [2,4-°H]-taurocholate (30Ci mmol !; American Radiolabelled 
Chemicals) for the indicated time intervals. Transport was terminated by the 
addition of ice-cold buffer containing 1mM CaCl, 1mM MgCh, 10mM Tris- 
HCl, pH 7.5, 137 mM NaCl or 137 mM ChCl, and 1 mM taurocholate, and was 
followed immediately by centrifugation at 20,500g for 60s. Cell pellets were 
washed several times in an equal volume of termination buffer and resuspended 
in 200 ul of the same buffer. The radioactivity corresponding to the internalized 
substrate was measured by scintillation counting. Each experiment was performed 
in triplicate. Nonspecific uptake was assessed by repeating the time course in 
triplicate for cells transformed with the same vector but expressing the sodium/ 
proton antiporter fusion NhaA—GFP-Hiss. In all experiments, ASBT yyy expres- 
sion was calculated on the basis of GFP fluorescence measured at 510 nm (excita- 
tion wavelength, 488nm) using a 96-well spectrofluorometer*'. In-gel 
fluorescence and FSEC data of DDM-solubilized whole cells of wild-type 
ASBTym and mutants were carried out essentially as described previously’®”’. 
Transport kinetics. The accumulation of taurocholate was linear within the first 
120s. For kinetic characterization, the initial velocity of taurocholate uptake at 


37°C was measured after 120s at the indicated increasing substrate concentra- 
tions. The radioactivity corresponding to the internalized substrate was measured 
by scintillation counting. Nonspecific uptake was measured by repeating the 
transport kinetics for cells transformed with the same vector, but expressing the 
sodium/proton antiporter fusion NhaA-GFP-Hiss. Specific ASBT\m uptake was 
calculated by the subtraction of nonspecific uptake from total uptake. Each experi- 
ment was performed in triplicate. The data were fitted to the Michaelis-Menten 
equation by nonlinear regression using the GRAPHPAD PRISM software. 
Activity of ASBTyy mutants. £. coli cells harbouring ASBT\—-GFP-Hisg 
mutants were resuspended in uptake buffer containing 4 1M taurocholate supple- 
mented with 0.16 1M [2,4-*H]-taurocholate (30 Cimmol '; American Radiolabelled 
Chemicals) for 5 min at 37 °C. The radioactivity corresponding to the internalized 
substrate was measured by scintillation counting. For each mutant, the uptake 
values were corrected for background by subtracting values from parallel assays 
carried out in the absence of sodium. Activities were plotted as percentages of the 
wild-type transport activity calculated in the same way. Each experiment was 
performed in triplicate. 

Substrate specificity. The whole-cell [*H]-taurocholate uptake assay was carried 
out similarly to that described for ASBT ym mutants, except that 150 1M of either 
taurocholate (Sigma), cyclosporin A (Sigma), bromosulfophthalein (Sigma) or 
fluvastatin (Cayman Europe) was added to the uptake buffer. 

Crystallization and preliminary screening. Crystals were grown at 20 °C using 
the vapour diffusion method. Taurocholic acid (Sigma) was added to the protein 
solution to a final concentration of 10 mM. The protein was then mixed 1:1 with 
reservoir solution containing 50mM sodium citrate, pH 4.5, 70mM NaCl and 
22-24% PEG 400. Crystals appeared overnight and reached a maximum size 
after 3-4 days. The crystals were frozen in liquid nitrogen and screened using 
synchrotron radiation at the European Synchrotron Radiation Facility and 
Diamond Light Source. Crystals are tetragonal with cell dimensions of approxi- 
mately 75 A X 75 A X 180A. The best of these crystals diffract to around 2.8- 
3.5 A; however, with dehydration the diffraction increases to ~2 A. 

Structure determination of a cysteine mutant of ASBT yy. As initial attempts at 
making heavy-atom derivatives with mercury compounds failed, Leu 87 was 
modified to cysteine (construct ASBTy,_ 1). The ASBT ym , protein crystallized 
similarly to the wild-type protein. Mercury-derivatized crystals were obtained 
from this mutant by incubating for 1h with 1 mM mercury acetate before crys- 
tallization. A single mercury-derivatized crystal of ASBT\m_; was used to solve the 
structure by single-wavelength anomalous dispersion. The crystal was frozen in 
liquid nitrogen and then re-annealed before data collection by leaving it in air for 
approximately 3 min. The re-annealing resulted in shrinkage of the unit cell and an 
increase in the resolution to 2.2A. Data were collected at the mercury edge 
(1.0060 A) on beamline 103 at the Diamond Light Source. 

Data were initially processed to 2.5 A by the XIA2* pipeline to XDS” set up on 
the beamline, with further processing using the CCP4 suite of programs”. The 
space group was determined to be P4,22, with one molecule in the asymmetric 
unit. An anomalous difference Patterson map showed clear peaks associated with 
one bound heavy atom. The heavy-atom coordinates were determined using 
RSPS”. Its position was refined and phases were calculated using SHARP“ with 
solvent flattening in SOLOMON”. The resulting phases were input to the auto- 
matic structure building implemented in PHENIX”. This resulted in a model that 
was reasonably complete. Modification and further building of the structure was 
carried out in O* and COOT™. At this point, the data were reprocessed using 
MOSFLM“*, extending the resolution to 2.2 A as judged from the scaling statistics 
(Supplementary Table 1) and the features in the resulting maps. 

Structural refinement was performed in BUSTER” using individual isotropic 
B-factor refinement and TLS”. The complete protein was chosen as a single TLS 
group because no significant drop in the Rfee value was observed when splitting 
the protein into multiple groups. 

Two ions were identified in the core of the protein. The residues coordinating 
these ions and the associated distances are consistent with their being sodium 
ions**. As an additional verification, the putative sodium ions were changed to 
water molecules and run through the program WASP”, which uses valence cal- 
culations to identify possible metal ions. After replacing all solvent and ions by 
water molecules as required by the program, only the two solvent molecules 
originally assigned as sodium ions were flagged as likely sodium ions. After all 
residues had been modelled, clear electron density remained in the cavity of the 
protein. This density was enhanced in a simulated-annealing omit map calculated 
in PHENIX”. The taurocholate structure, downloaded from the Cambridge 
Structural Database (accession code, KORZUM), clearly fitted the density with 
the cholate head group positioned at the bottom of the cavity (Supplementary 
Fig. 9b). A further taurocholate was observed in the crystal interface. The final 
model has an R-factor of 19.7% and a corresponding Rgee value of 22.9%, and 
contains all protein residues from 2 to 309, two sodium ions, one mercury atom, 
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two taurocholate molecules, 37 water molecules, five LDAO molecules and two 
truncated phospholipids (phosphatidylethanolamine). The final refinement statistics 
of this model, which was used to solve the wild-type protein, are summarized in 
Supplementary Table 2. 

Structure determination and refinement of ASBT ym. Because the re-annealing 
of the ASBTym 1 construct in air was not reproducible, dehydration was 
attempted using the humidity controller HC1 device*? mounted on beamline 
102 at the Diamond Light Source. By placing the crystal into an air stream at 
45% relative humidity for 5 min before freezing it, crystals were found to repro- 
ducibly diffract to ~2.0 A. Data were collected from a single crystal of ASBT yy, on 
beamline 102 at the Diamond Light Source. The data were processed in XDS*” 
using the XIA2 pipeline*’ and scaled at a resolution of 2.2 A (Supplementary Table 1). 
The structure was refined, as above, starting from the final model of the ASBT yy 1 
construct, minus all non-protein residues. No appreciable differences were observed 
in the wild-type and mercury-derivatized structures. Taurocholate and detergent 
molecules were modelled in the same positions as for ASBT ym, 1. The final model has 
an R-factor of 21.2% and an Ree value of 24.4% (Supplementary Table 2). 
Structural analysis. Superpositions were carried out in LSQMAN”". The super- 
positions were performed so that only Co: pairs that were less than 3.8 A apart were 
included in the calculation. The numbers quoted in the text regarding the 
topology-inverted repeats of ASBTym were calculated between pairs of Co atoms 
that were less than 10 A apart. This was considered necessary to include atoms 
from both the V-motif and the core motif. In comparing ASBTym with NhaA 
(Protein Data Bank ID, 1ZCD), only pairs of atoms less than 5A apart after 
superposition were chosen, giving an r.m.s.d. of 2.9 A for 202 out of a possible 
308 pairs of Cx atoms. The volume of the cavity was calculated in VOIDOO” using 
a probe radius of 1.4 A. Figures showing the structure were drawn using PYMOL™ 
except those showing electron density, which were made using CCP4MG™. 
Outward-facing model. As in LeuT”, in ASBT yy, the protein is made up of two 
five-transmembrane-helix repeats that when superimposed show a small rotation 
of two transmembrane helices with respect to the other three (Supplementary 
Fig. 4). For LeuT, it was shown that by swapping the conformations of the 
N- and C-terminal topology-inverted repeats the structure can be changed from 
an outward-facing state to an inward-facing state’*. In ASBT yy the lengths of the 
two topology-inverted repeats are very similar. To create an outward-facing back- 
bone model of ASBTy in an analogous manner to that carried out for LeuT, 
TM1 to TMS5 were superposed on TM6 to TM10, and vice versa. 
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Extremophilic organisms require specialized enzymes for their 
exotic metabolisms. Acid-loving thermophilic Archaea that live 
in the mudpots of volcanic solfataras obtain their energy from 
reduced sulphur compounds such as hydrogen sulphide (H,S) 
and carbon disulphide (CS,)'”. The oxidation of these compounds 
into sulphuric acid creates the extremely acidic environment that 
characterizes solfataras. The hyperthermophilic Acidianus strain 
A1-3, which was isolated from the fumarolic, ancient sauna build- 
ing at the Solfatara volcano (Naples, Italy), was shown to rapidly 
convert CS, into H,S and carbon dioxide (CO,), but nothing has 
been known about the modes of action and the evolution of the 
enzyme(s) involved. Here we describe the structure, the proposed 
mechanism and evolution of a CS, hydrolase from Acidianus A1-3. 
The enzyme monomer displays a typical B-carbonic anhydrase fold 
and active site, yet CO, is not one of its substrates. Owing to large 
carboxy- and amino-terminal arms, an unusual hexadecameric 
catenane oligomer has evolved. This structure results in the block- 
ing of the entrance to the active site that is found in canonical 
B-carbonic anhydrases and the formation of a single 15-A-long, 
highly hydrophobic tunnel that functions as a specificity filter. 
The tunnel determines the enzyme’s substrate specificity for CS2, 
which is hydrophobic. The transposon sequences that surround the 
gene encoding this CS, hydrolase point to horizontal gene transfer 
as a mechanism for its acquisition during evolution. Our results 
show how the ancient B-carbonic anhydrase, which is central to 
global carbon metabolism, was transformed by divergent evolution 
into a crucial enzyme in CS, metabolism. 

Acidianus species belong to the order Sulfolobales, whose members 
are common inhabitants of solfataric regions worldwide, and these 
species are good models of evolution owing to the geographic separation 
of their habitats’. The hyperthermophilic archaeon Acidianus A1-3 (refs 
4, 5) was isolated from biofilm material from the fumarolic, ancient 
sauna building at the Solfatara volcano (Naples, Italy) on a mineral 
medium supplemented with CS,. It is the first example ofa thermophilic 
CS,-converting archaeon. Under anaerobic or aerobic conditions, cell 
extracts of Acidianus A1-3 converted CS, to H,S, with COS as the 
intermediate: CS, + H,O—> COS + H,S and COS + H,O-— CO, + H,S. 
The enzyme responsible was therefore likely to be a hydrolase, as has 
been reported for a mesophilic CS,-converting Thiomonas species®, 
rather than an oxygenase’. 

We proposed that the hydrolysis of CS; to CO, by way of COS is 
similar to the hydration of CO, to HCO; by carbonic anhydrases: 
CO,+H,0 HCO; +H”. Indeed, conversion of CS, by carbonic 
anhydrase has been explored previously by quantum chemical 
methods*”°. In addition, COS is a known substrate homologue for 
plant and algal carbonic anhydrases''’”, and plants may constitute 
the most important sink of atmospheric COS". Furthermore, the 


carbonic anhydrase of the flour beetle Tribolium castaneum can 
hydrolyse COS to H2S"*. However, CS, has not been found to be an 
effective substrate for carbonic anhydrase, suggesting the presence of a 
different enzyme for CS, conversion. 

The CS;-converting enzyme (subunit mass 24kDa) was purified 
from Acidianus Al-3 cells and shown to be a CS, hydrolase. The 
Michaelis constant (K,,) for CS; is 130M, which is 20-fold 
lower than the K,, for CO, of B-carbonic anhydrase (Cab) from 
Methanothermobacter thermoautotrophicus (2.9 mM)" and 100-fold 
lower than the K,, for CO, of y-carbonic anhydrase (Cam) from 
Methanosarcina thermophila (15mM)'°. The catalytic rate (k.a,) of 
the CS, hydrolase (952 s ') is also lower than the k.q¢ values of Cab 
(1.7 X 10*s *) and Cam (3 X 10‘s~'), resulting in a similar overall 
catalytic efficiency (kat/Km) of 2-7.3 X 10°s_'M" for both the CS, 
hydrolase and the carbonic anhydrases (Supplementary Table 2). The 
CS, hydrolase converted the intermediate COS with a higher affinity 
(Ky = 22 UM) and k, 4, (1,800 s+) than for CS, (Supplementary Fig. 1). 
The catalytic efficiency of the CS; hydrolase is almost an order of 
magnitude higher for COS than for CS;. The CS; hydrolase did not 
show carbonic anhydrase activity in stopped-flow spectrometry’” 
experiments. Inductively coupled plasma mass spectrometry (ICP- 
MS) analysis showed zinc to be associated with the CS, hydrolase, 
as in most aerobic carbonic anhydrases that have been studied so 
far. Exceptions are cadmium-containing carbonic anhydrases from 
diatoms’* and iron-containing anaerobic carbonic anhydrases from 
methanogens'*"’. The gene encoding the CS, hydrolase was identified 
by peptide sequencing and PCR amplification. Sequence analysis 
showed high homology to B-carbonic anhydrases (Fig. 1 and Sup- 
plementary Fig. 2). 

The structures of two crystal forms of the CS, hydrolase 
(selenomethionine-labelled heterologously expressed protein and 
native purified protein) were determined to resolutions of 2.6 A and 
2.4 A, respectively (Supplementary Table 3). Crystalline CS, hydrolase 
is active, as confirmed by the generation of H,S from CS, in the crystal 
(Supplementary Fig. 3). The Acidianus CS, hydrolase displays a B- 
carbonic anhydrase fold: two monomers form a closely intertwined 
dimer with a central B-sheet capped by an -helical domain (Fig. 2a). 
In both crystal forms, the CS, hydrolase dimers form square octamers 
with a central hole, through interactions of long arms at the N and C 
termini (Fig. 2a, b). Similar octameric rings have been reported for the 
B-carbonic anhydrase of Pisum sativum. However, in the CS, hydro- 
lase, two octameric rings form a hexadecamer by interlocking at right 
angles to each other, forming a highly unusual catenane structure 
(Fig. 2c). To our knowledge, the only other examples of natural protein 
catenanes are the bovine mitochondrial peroxiredoxin IT] (ref. 21) and 
the gp5 capsid protein of bacteriophage HK97 (ref. 22). Small-angle 
X-ray scattering (SAXS) (Fig. 2d) and analytical ultracentrifugation 
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Figure 1 | Neighbour-joining phylogenetic tree showing the evolutionary 
relationship between carbonic anhydrases (CAs) and the newly discovered 
CS, hydrolases. The bootstrap consensus tree inferred from 500 replicates is 
taken to represent the evolutionary history of 96 sequences that were retrieved 
from GenBank and through BLAST, aligned using MUSCLE (http:// 
www.ebi.ac.uk/Tools/msa/muscle/) and then analysed in MEGA 4.0. Branches 
corresponding to partitions that were reproduced in less than 50% of bootstrap 
replicates are collapsed. The bootstrap values are indicated at the branch points. 
The tree is drawn to scale, with branch lengths in the same units as those of the 
evolutionary distances that were used to infer the phylogenetic tree (scale bar, 


(Supplementary Fig. 4) confirmed the presence of this unusual 
catenane structure in solution. In addition, both forms of the enzyme 
were detected and shown to actively convert CS, to H2S on native 
polyacrylamide gel electrophoresis (PAGE) gels (Supplementary 
Fig. 5). Considering the large amount of CS, hydrolase present in a 
crude extract of Acidianus A1-3 cells growing on CS, (Supplementary 
Fig. 5), as well as the gaseous nature of its intermediate product, COS, 
tight packing of the enzyme would enable more enzyme to be present 
in the cell and result in a higher density of active sites for COS to react 
with before it diffuses away. Moreover, the arms at the N and C termini 
that form the octamer lie along the sides of the square octameric ring, 
so interlocking two of these rings may help to keep these arms in place, 
stabilizing the octameric structure. 

The active site of native CS, hydrolase contains a zinc ion coordinated 
by Cys 35, His 88, Cys 91 and a solvent molecule (Fig. 3a). Other solvent 
molecules were not observed in the active site. Structural comparisons 
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0.2 amino acid substitutions per site). All positions that contain alignment gaps 
and missing data were eliminated only in pairwise sequence comparisons. The 
final data set contained 1,382 positions. The eight y-carbonic anhydrase and six 
a-carbonic anhydrase sequences were used as outgroups (grey). B-Carbonic 
anhydrase sequences are clustered in clades (dicotyledons, monocotyledons 
and clades A-D). CS, hydrolase sequences are clustered in clade D. Gene or 
protein names are indicated in parentheses where relevant. Confirmed CS, 
hydrolases are shown in bold. Accession numbers are listed in the 
Supplementary Information. 


of the active sites of the CS; hydrolase and the most closely related 
B-carbonic anhydrases**”*, as identified by the Dali program’, showed 
zinc-coordinating residues at identical positions (Fig. 3b). This finding 
suggests that the first step in the catalytic mechanism of the CS, hydro- 
lase, involving a hydroxide ion, is analogous to that of B-carbonic 
anhydrase** (Supplementary Fig. 6). Quantum chemical studies of 
carbonic anhydrase active site models”’® suggest that the zinc ion is 
subsequently coordinated by both the oxygen from the former hydroxide 
and a sulphur atom from the substrate. The cleavage of the very stable 
Zn-S bond of the HS” — Zn complex could be assisted by protonation of 
the HS” moiety’. The hydrolysis of COS may proceed analogously 
(Supplementary Fig. 6). 

The similarity between the CS, hydrolase and carbonic anhydrase 
active sites raises the question of why the CS, hydrolase has a markedly 
different reaction specificity from carbonic anhydrase, exclusively 
hydrolysing CS, and COS rather than CO). The answer may lie in 
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Figure 2 | X-ray crystal structure of the Acidianus A1-3 CS, hydrolase. 

a, Dimer structure. One monomer is shown in red, and the other in blue. The 
long arm-like extensions at the N and C termini of each monomer are indicated. 
The active site zinc ions are indicated as grey spheres. b, Four dimers form a 
square octameric ring through extensive interactions of the N- and C-terminal 
arms. Monomers are shown in contrasting colours. c, The hexadecameric 
complex is formed by two interlocked octameric rings. One ring is shown with 
the monomers in various shades of blue, and the other with the monomers in 
various shades of green. Access routes to the active site tunnels are shown in red. 
d, SAXS results from a solution of CS, hydrolase (data points) and fits of the 
data (curves) assuming 100% hexadecamer (blue), 100% octamer (green) or a 
mixture of 83% hexadecamer and 17% octamer (red). S corresponds to the 
momentum transfer in A~'. 


the access to the active sites of these enzymes. Superimposing struc- 
turally related carbonic anhydrases shows that the same funnel sup- 
plies access to the active site (Supplementary Fig. 7). However, in the 
CS, hydrolase, this access route is blocked by the protomer N- and 
C-terminal arms within the octamer (Supplementary Fig. 7), and the 
only remaining access to the active site is by way of a very narrow and 
highly hydrophobic tunnel of 15 A in length (Fig. 3c), which is lined by 
residues from three monomers. Thus, the formation of the octamer 
substantially reduces access to the active site for hydrophilic substrates. 
Indeed, CS, is over an order of magnitude more hydrophobic than 
COz, as judged by octanol:water partitioning coefficients (using the 
LOGKOW database, http://logkow.cisti.nrc.ca/logkow/). Although we 
cannot exclude second-sphere effects, we propose that the narrow 
hydrophobic tunnel is an important factor in determining the sub- 
strate specificity of the CS, hydrolase. 

To test whether active site accessibility determines the reaction spe- 
cificity, we designed seven CS, hydrolase mutants, with the amino acid 
residues that form the tunnel wall being replaced by smaller or larger 
residues. Furthermore, the C terminus was truncated (mutant 
Gly199Stop) (Fig. 4a). The activities of the mutant CS, hydrolases were 
quantified (Fig. 4b) and were visualized on native PAGE activity gels 
(Supplementary Fig. 5). The mutants with a change in Phe 78 lost most 
or all of their activity (Fig. 4b). Phe 78 is close to the active site (Fig. 4a), 
and any change in this region of the protein adversely affected enzyme 
activity. By contrast, widening the entrance of the tunnel by shortening 
the C terminus or by replacing large hydrophobic residues with smaller 
ones (for example, Phe77Ala or Phe96Ser) (Fig. 4a) reproducibly 
increased the catalytic efficiency of the mutants by up to twofold 
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Figure 3 | The CS, hydrolase active site and the entrance to the active site. 
a, The active site of the CS, hydrolase. The catalytic zinc ion (blue sphere) is 
coordinated by two cysteine residues, a histidine residue and a solvent molecule 
(w, red sphere). The interactions between the zinc atom and its ligands are shown 
as dashed red lines. The final, refined 2mF, — DF- electron density (blue, 10) is 
shown. The F, — DF- electron density (green, 20), which was calculated before 
the incorporation of the solvent molecule into the model, is also shown, as well as 
the anomalous difference density (orange, 50), which confirms the position of 
the zinc ion. b, Superposition of active site residues of the CS, hydrolase (pale 
blue, carbon atoms) and structurally related B-carbonic anhydrases (white, 
carbon atoms): Cab (clade D) from M. thermoautotrophicus”, Rv3588c (clade 
C) and Rv1284 (clade D) from Mycobacterium tuberculosis**, Nce103 (clade A) 
from Saccharomyces cerevisiae’? and CAN2 (clade A) from Cryptococcus 
neoformans’®. For Mycobacterium Rv3588c, which is a type II B-carbonic 
anhydrase, the equivalent of Asp 37 coordinates the zinc ion, indicating the 
position of the substrate. By contrast, in the type I B-carbonic anhydrases, this 
residue forms a salt bridge with an arginine residue, as is the case for the CS, 
hydrolase (Arg 39). The hydrolytic solvent molecule in the CS hydrolase (w) is 
indicated as a red sphere. c, Hydrophobic tunnel (green) providing access to the 
CS, hydrolase active site. The residues that constitute the tunnel wall are 
indicated, and each protein monomer is individually coloured (red, blue and 
grey). a—c, Nitrogen atoms are shown in dark blue, oxygen atoms in red and 
sulphur atoms in yellow. 


(Fig. 4b). This finding suggests that the CS, conversion rate depends 
on the properties of the tunnel in modulating the accessibility of the 
active site to the reactants (CS, COS or H2O) or to product release. 
Although one might expect the reactivity towards CO, to increase when 
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the tunnel entrance is widened, this was not the case; none of the mutants 
showed carbonic anhydrase activity when tested using stopped-flow 
spectrometry. It is possible that it is not CO, that poses accessibility 
problems but its hydrolysis product, HCO; . Furthermore, CO, is 
capable of hydrogen bonding to the surrounding water molecules, 
whereas CS, is not; therefore, the narrow tunnel might discriminate 
between CO2-H,0 complexes and CS, on the basis of size. This would 
also explain how the reaction products CO, and HO could exit the 
active site through the tunnel. 

To investigate possible routes by which the CS, hydrolase evolved, 
we searched for homologous protein sequences and found a 204 amino 
acid protein (AAK41461) encoded in the genome of Sulfolobus 
solfataricus P2 (ref. 29) (Fig. 1 and Supplementary Fig. 2), one of 
two clade D f-carbonic anhydrase homologues in this organism. 
The encoding gene (SSO1214) is not present in the genomes of other 
members of the Sulfolobales that have been sequenced, except in one of 
the seven sequenced Sulfolobus islandicus genomes’ (strain L.S.2.15) 
(Fig. 1 and Supplementary Fig. 2). Indeed, we detected CS, hydrolase 
activity in S. solfataricus P2 cells but not in cells of its close relative 
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Figure 4 | Effect of mutations in the access tunnel on activity. a, The 
rationale underlying the design of selected mutants. The tunnel is indicated in 
green. Native residues that were mutated are shown as large grey spheres; some 
of the mutations are modelled in orange. The large blue sphere is the nitrogen of 
the Trp residue. Nitrogen atoms are shown in dark blue, oxygen atoms in red 
and sulphur atoms in yellow. The interactions between the zinc atom (small 
blue sphere) and its ligands are shown as dashed red lines. Phe 77 and Phe 96 
were replaced by smaller residues; Phe 78 was replaced by Trp; and Ile 201 is 
absent in the Gly199Stop mutant. A stereo version of this panel is provided as 
Supplementary Fig. 9. b, Michaelis-Menten kinetics of the CS, hydrolase 
mutants. The data are presented as the mean + s.e.m., as calculated from three 
independent experiments. The Km, Vinax and keat/Km (in s-' M7!) values were 
determined by nonlinear regression analysis. Phe778Ser and Phe778Ala 
correspond to the Phe77Ser plus Phe78Ser and Phe77Ala plus Phe78Ala double 
mutants, respectively. ND, not determined. 
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Sulfolobus shibatae or in three Acidianus species (Acidianus brierleyi, 
Acidianus ambivalens and Acidianus infernus). The heterologously 
expressed S. solfataricus SSO1214 gene product was shown to have 
CS, hydrolase activity but no carbonic anhydrase activity (Supplemen- 
tary Fig. 8), suggesting that the S. solfataricus AAK41461 protein has 
been erroneously annotated as a carbonic anhydrase. As is the case for 
the Acidianus CS, hydrolase, AAK41461 is a carbonic anhydrase 
homologue that has evolved divergently to have CS, hydrolase activity. 
Most likely, the true carbonic anhydrase of S. solfataricus P2 is the 
second carbonic anhydrase gene to be identified, SSO2735, which 
encodes the protein AAK42846. This protein clusters with other 
archaeal f-carbonic-anhydrase-like enzymes, whereas the CS, 
hydrolases cluster mainly with putative bacterial B-carbonic anhy- 
drases (Fig. 1). Inspection of the genomic environment of the CS, 
hydrolase gene SSO1214 revealed the presence of transposon- 
associated genes both upstream (SSO1212) and downstream 
(SSO1216 and SSO1217) of SSO1214. These transposon-associated 
genes could also be identified surrounding hypothetical CS, hydrolase 
genes from S. islandicus strain L.S.2.15, Nitrosomonas sp. AL212, 
Mycobacterium marinum and Haloquadratum walsbyi. Further- 
more, a Phe-77 Phe-78 motif is present in confirmed CS, hydrolases 
and in the above strains but not in confirmed carbonic anhydrases, 
strongly suggesting that this motif is a characteristic feature of CS, 
hydrolases (Fig. 1 and Supplementary Fig. 2). In addition, as men- 
tioned above, Phe-78-modified mutants all lose their CS, hydrolase 
activity. Taken together, these findings point to lateral gene transfer as 
a likely mechanism for the acquisition of the CS, hydrolase gene in 
these organisms, rather than duplication and divergence of their car- 
bonic anhydrase gene. It may also explain why the ability to convert 
CS, is not shared by all Sulfolobales. 

Our results show a new example of divergent evolution: CS, and CO, 
are non-interchangeable substrates of CS; hydrolases and carbonic 
anhydrases, although the catalytic sites of these enzymes are identical. 
The difference between the two enzymes lies in the unusual oligomeric 
structure of CS, hydrolase, in which a specificity filter is built into the 
active site entrance. Thus, a new enzyme emerged owing to the evolu- 
tion of a new quaternary structure rather than through active site muta- 
tions. The omnipresence and convergent evolution of the carbonic 
anhydrases, as well as the absence of the CS, hydrolase in most of the 
members of the Sulfolobales that have been sequenced, suggests that 
carbonic anhydrases are the older of the two enzymes. Our data con- 
tribute to the ongoing discussion on the evolution and (re)design of 
enzymes, which has been reviewed recently”. 


METHODS SUMMARY 


Acidianus strain A1-3 was cultivated at 70 °C in a chemostat on modified Allen’s 
mineral medium with CS, as the main carbon and energy source. Cells from the 
chemostat effluent were collected by centrifugation and lysed using a French press. 
CS, conversion was examined by gas chromatographic analysis or with an H2S 
sensor. Carbonic anhydrase activity was measured by stopped-flow spectrometry 
at 25 °C. CS, hydrolase activity on native PAGE gels was determined using Pb(11) 
acetate. The Acidianus CS, hydrolase was purified from cell-free extracts by cation 
and anion exchange chromatography. The molecular mass of the purified CS, 
hydrolase was determined by matrix-assisted laser desorption/ionization-time of 
flight mass spectrometry (MALDI-TOF MS), and ICP-MS was used to determine 
the presence of metal cofactors. The purified CS, hydrolase was digested with 
trypsin, and the dominant peptides were sequenced using tandem mass spectro- 
metry. The amino acid sequences were used to design degenerate PCR primers, 
and the CS, hydrolase gene was amplified with Acidianus genomic DNA as a 
template. S. solfataricus SSO1214 was cloned after PCR amplification from geno- 
mic DNA. Native and mutant proteins were obtained by heterologous expression 
of synthetic gene constructs and were purified using heat-shock precipitation and 
gel filtration where applicable. Phylogenetic analyses were carried out using the 
MEGA 4.0 package. Crystal structures were determined from native and seleno- 
methionine-labelled protein crystals, by using single-wavelength anomalous dis- 
persion methods. SAXS and analytical ultracentrifugation were used to ascertain 
the oligomeric state of the protein. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Strains, media and culture conditions. Acidianus strain A1-3 was isolated from 
the fumarolic, ancient sauna building at the Solfatara volcano (Naples, Italy). 
Enrichment and pure cultures of this strain were grown in a chemostat at 70 °C 
on modified Allen’s mineral medium* supplemented with 0.1 g1~' yeast extract 
and Vishniac trace elements” with an additional 4.5 mg]™ ' Na B,O,"10H5O and 
0.03 mg] ' VOSO,"5H,0. The medium was acidified with 0.125% (v/v) sulphuric 
acid, resulting in a pH of approximately 1. The main carbon and energy source was 
supplied as 80 uM CS, gas mixed with air, bubbled through the reactor with a gas 
flow of 20mlmin™'. The growth yield owing to the consumption of CS. was 
calculated to be 80% of the total growth yield under these conditions. A pure 
culture of Acidianus A1-3 was obtained by plating an enrichment culture onto 
Gelrite plates containing mineral medium with 2% elemental sulphur (S°) as the 
electron donor, 0.1 g I=? yeast extract and 0.1% (v/v) H2SOx. Plates were incubated 
at 70°C with 5% (v/v) COz gas as the carbon source. Single colonies were sub- 
cultured twice until purity. After a pure culture was obtained with growth on CO, 
and S°, the capacity of the microorganism to utilize CS, was reconfirmed. 

A. ambivalens 3772, A. brierleyi 6334 and A. infernus 3191 were obtained from 
DSMZ and grown at 80°C, 70°C and 88°C, respectively. They were grown in 
medium 358 (http://www.dsmz.de/microorganisms/medium/pdf/DSMZ_Medium358. 
pdf), pH 2.5, supplemented with 1% (w/v) S° (A. ambivalens and A. infernus) or 
5% (w/v) S° (A. brierleyi), 5% CO and 0.02% (w/v) yeast extract (A. ambivalens) or 
1% (w/v) yeast extract (A. brierleyi and A. infernus). S. solfataricus P2 and S. 
shibatae (courtesy of J. van der Oost) were grown at 80°C in modified Allen’s 
mineral medium with either sterile S° plus 5% CO, or 16.7 mM glucose. Strains 
were grown in serum bottles closed with grey butyl rubber stoppers. When cultures 
had grown for approximately 1 week, CS, was added to a head-space concentration 
of 201M, and the conversion of CS, to COS and H2S was examined using a 
Chrompack CP9001 gas chromatograph, as described previously”’. 

Cell-free extracts. Chemostat effluent was collected and kept at 4°C for a 
maximum of 4 days before cells were collected by centrifugation (10,000g, 
10 min, 4°C). The pellet was washed in 10mM phosphate buffer, pH7, until 
the pH was between 6 and 7. The cells were lysed by at least four passes through 
a French press (American Instrument Corporation) at 1,260 psi. Cell-free extract 
was obtained by centrifuging the suspension at 48,000g for 30 min at 4°C, to 
remove unlysed cells and cell debris. Glycerol was added to 5% (v/v), and extracts 
were stored at —20°C. 

Mechanism of CS, conversion. The conversion of CS, into HS and COS was 
followed in aerobic and anaerobic 120 ml stopper bottles containing 5 ml 20 mM 
phosphate buffer, pH 7. For the anaerobic experiments, the bottles were flushed 
with N> for 30 min. In addition, the syringes were flushed with N> before use. The 
CS, gas stock bottles were prepared by adding 20 ll liquid CS to 120 ml bottles, 
with the liquid evaporating inside the bottles to form CS, gas stock bottles. CS) gas 
from the aerobic and anaerobic stock bottles was injected into the buffer-containing 
aerobic and anaerobic 120 ml bottles, to achieve a head-space concentration of 
8.5 uM. The bottles were incubated at 60 °C with shaking at 100 r.p.m. Pre-warmed, 
diluted Acidianus cell-free extract (13 Lg protein in 100 pl) was injected into the 
bottles, and the conversion of CS, into its products, COS and H,S, was followed by 
gas chromatography. 

Enzyme purification. The Acidianus CS hydrolase was purified from cell-free 
extracts by cation exchange chromatography, followed by anion exchange chro- 
matography. The chemicals used for chromatography were of ultrapure grade. The 
Acidianus cell-free extract (150 mg protein) was loaded onto a 14 X 2cm column 
packed with Carboxymethyl Sepharose Fast Flow beads (GE Healthcare), which 
was equilibrated with 50 mM acetate buffer, pH 5. The proteins were separated at 
room temperature, using fast protein liquid chromatography (Hitachi) with a 
linear gradient from 0 to 1M NaCl over 10 column volumes and a flow rate of 
2ml min‘. The fractions were collected and assayed for CS, hydrolase activity 
(see below). The active fractions were pooled and concentrated twofold, and the 
buffer was exchanged to 20 mM Bis-Tris propane-HCl, pH 6.5, by filtration in an 
ultrafiltration cell with a 43-mm diameter PM30 membrane (Amicon) at room 
temperature and with stirring at 100 r.p.m. The concentrated and washed fractions 
were filtered through a 0.45-1m syringe filter unit (Millex-HV, Durapore PVDF) 
and loaded onto an anionic DEAE Sepharose Fast Flow 5PW column (75 X 7.5 
mm, TSK) equilibrated with 20 mM Bis-Tris propane-HCl, pH 6.5, using high- 
performance liquid chromatography (with Agilent 1100 series equipment). The 
proteins were separated at room temperature, using a gradient from 0 to 0.5M 
NaCl over 25 min (4.5 column volumes) with a flow rate of 0.6ml min '. The 
fractions containing the purified enzyme were stored on ice or, if not used within 1 
week, were frozen at —20 °C after the addition of glycerol to 5% (v/v). 
MALDI-TOF MS. The molecular mass of the purified CS, hydrolase was deter- 
mined by matrix-assisted laser desorption/ionization time-of-flight mass spectro- 
metry (MALDI-TOF MS). The purified protein fraction (4.7 pmol in 0.5 pl) was 
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mixed with an equal volume of matrix solution, consisting of 10g] ”' 
3,5-dimethoxy-4-hydroxycinnamic acid in 0.05% (v/v) trifluoroacetic acid and 
50% (v/v) acetonitrile. The resultant mixture was spotted onto a stainless steel 
target plate, dried and analysed in linear mode on a Biflex II] MALDI-TOF mass 
spectrometer (Bruker), and then calibrated with a ProteoMass Protein MALDI- 
MS Calibration Kit (Sigma). 

Determination of metal-associated cofactors by ICP-MS. ICP-MS was used to 
determine the presence of metal cofactors associated with 17.6 ug purified CS, 
hydrolase from the anion exchange purification step (in 20 mM Bis-Tris propane- 
HCl, pH 6.5, and 230 mM NaCl). The negative controls consisted of protein frac- 
tions without CS, hydrolase activity, buffer and deionized water. Carbonic anhy- 
drase from bovine erythrocytes (201g; SERVA) served as a positive control. 
Samples were prepared and analysed on an ELEMENT 2 ICP MS (Thermo 
Scientific) according to the manufacturer’s protocol. Height point calibration 
curves were created by dilution of a standard multielement solution of zinc and 
copper (at a final concentration of 0, 10, 20 and 50 p.p.b.; Merck) in 0.6% nitric acid 
solution. Quantification was performed using PlasmaLab software (Thermo). 
Protein gel electrophoresis. SDS-PAGE (12% polyacrylamide, pH 8.3) and 
native PAGE (8% polyacrylamide, pH8.3) were performed using a Mini- 
PROTEAN 3 Cell (Bio-Rad) at room temperature. Fermentas SM0431 molecular 
weight standards were used for SDS-PAGE. For native PAGE, various mixtures of 
bovine erythrocyte carbonic anhydrase (SERVA), blue dextran, apoferritin, 
B-amylase, B-lactoglobulin, thyroglobulin and albumin (Sigma) were used as 
indicators of molecular weight. Proteins were visualized with Coomassie 
Brilliant Blue G-250. The CS, hydrolase activity on native PAGE gels was deter- 
mined by incubating the gels in an airtight jar at 60°C in 20mM Tris, pH6.9, 
containing 1 mM Pb(11) acetate, in the presence of about 2 mM CS, gas (achieved 
by injecting 15 ml CS,-saturated air from an airtight 11] bottle containing 50 ml 
CS). The gels were incubated until brown bands, consisting of lead sulphide 
precipitate, appeared. 

CS, hydrolase activity measurements. The CS, hydrolase activity in the puri- 
fication fractions was monitored qualitatively by adding samples to microtitre 
wells containing 100 pl 10 mM phosphate buffer or 20 mM HEPES buffer, pH 7, 
and 10 mM Pb(1) acetate. The Pb(11) acetate precipitated over time, but this did not 
negatively influence the activity tests. The microtitre plate was incubated for 
5-30 min at 50-70 °C in an airtight jar in the presence of CS, as described above. 
In the wells containing the active fractions, the H2S produced by the CS, hydrolase 
formed a clearly visible brown lead sulphide precipitate. The CS, hydrolase activity 
was quantified by gas chromatography, as well as with an H,S microsensor. For gas 
chromatography, appropriate amounts of the enzyme were diluted in 10mM 
phosphate buffer, pH 7, in 120 ml bottles sealed with grey butyl rubber stoppers, 
and 5 ml CS,-saturated air was injected to a final concentration of about 700 hM 
CS. The bottles were incubated at 50°C with vigorous shaking to ensure good 
transfer of gas into the buffer. The formation of CS, hydrolysis products (COS and 
H,S) was measured by gas chromatography for about 45 min, and the initial 
production rates were used to calculate enzyme activity. Alternatively, an H,S 
Clark-type microsensor (Unisense) was used. A 6.6 mM CS, stock was prepared 
by mixing 200 ul with 500 ml distilled H,O in a 500 ml serum bottle by vigorous 
shaking for at least 30 min. Mutant and native enzyme extracts were diluted in 
20 mM HEPES, pH 7. Depending on the activity of the enzyme, 0.5-6 jug protein 
was added to a 1 ml vessel containing 1 ml 20 mM HEPES, pH 7, and was stirred 
at 500 r.p.m. in a 50°C water bath. CS, was added from the stock bottle to 
concentrations of 50-660 uM CS;, and the formation of H,S was followed for 
approximately 30s. The initial H,S production rates were calculated, and the 
Michaelis-Menten parameters (K,, and V,,ax) were determined by nonlinear 
regression using Prism 5 (GraphPad). COS activity measurements were per- 
formed by gas chromatography as described above, using 600 il enzyme solution 
and 20 mM HEPES buffer, pH 7. 

Peptide sequencing. The purified CS, hydrolase was reduced and alkylated before 
digestion using dithiothreitol and iodoacetamide, followed by trypsin digestion, 
desalting and concentration using stop-and-go elution tips**. Mass spectrometric 
de novo sequencing of peptides was performed on a 7 Tesla hybrid linear ion-trap 
Fourier transform ion-cyclotron-resonance mass spectrometer (LTQ-FT; Thermo 
Scientific), coupled with an 1100 series nanoflow liquid chromatograph (Agilent). 
Chromatographic separations were performed using a 15-cm fused silica emitter 
(PicoTip emitter; tip, 8 + 1 jim; internal diameter, 100 ttm; FS360-100-8-N-5- 
C15; New Objective) packed in-house with reverse phase ReproSil-Pur C18AQ 
3 um resin (Dr. Maisch). The peptides were loaded directly onto the analytical 
column over 20 min ata flow rate of 600 nl min’! in 3% (v/v) acetonitrile and 0.5% 
(v/v) acetic acid. The peptides were eluted from the column using a 60-min linear 
gradient of 10% acetonitrile and 0.5% acetic acid to 40% acetonitrile and 0.5% 


acetic acid at a flow rate of 300 nl min ?. 
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The mass spectrometer was operated in positive ion mode and was pro- 
grammed to analyse the two most abundant ions from each precursor scan by 
collision-induced dissociation experiments. Precursor scans by the ion-cyclotron- 
resonance cell were set to cover a range of 350-2,000 m/z at a resolution power of 
5X 10* FWHM at m/z = 400. Each peptide ion selected for collision-induced 
dissociation analysis was fragmented twice in the linear ion trap (30% normalized 
collision energy, 30 ms activation time, activation q = 0.250, AGC target 3 x 10* 
ions, and 4 Th isolation width), and fragment ions were analysed in the ion trap 
and in the ion-cyclotron-resonance cell. 

The fragmentation spectra from the precursor ions of m/z [M+ 2H]?* = 

518.2834, [M+ 2H]** = 669.8513, [M + 2H]** = 733.8698 and [M+ 2H]** = 
947.5529 were further analysed by automated de novo analysis using PEAKS 
Studio 4.2 (Bioinformatics Solutions). The settings used for de novo sequence 
interpretation included trypsin specificity with a maximum of one missed cleavage 
and carboxamidomethyl on cysteine residues as a fixed modification, and they 
allowed the oxidation of methionine as a variable modification. Mass tolerances 
were set to 20 p.p.m. for precursor ions and 0.6 Da for fragment ions measured in 
the linear ion trap or to 20 p.p.m. for fragment ions analysed in the ion-cyclotron- 
resonance cell. Automated de novo sequencing that was validated and corrected by 
manual interpretation resulted in the following partial sequences from the respective 
[M+ 2H]’* ions: LEDYALR (m/z = 518.2834), VSEYLDSE (m/z = 669.8513), 
TGYVYEVETHR (m/z = 733.8698) and KPTEVQLDP (m/z = 947.5529). The 
peptide sequences were compared with databases at the National Center for 
Biotechnology Information using BLASTP. The closest homologous protein was 
the annotated carbonic anhydrase AAK41461 from S. solfataricus P2. 
Cloning the Acidianus A1-3 CS, hydrolase gene. Genomic Acidianus DNA was 
isolated**, and an internal fragment of the Acidianus CS, hydrolase gene was 
amplified by degenerate PCR, using the amino acid sequence obtained by the de 
novo sequencing of the four most abundant peptides, in combination with the 
aligned homologous gene from S. solfataricus P2 and the A. brierleyi codon usage, 
to design degenerate PCR primers (Supplementary Table 1). PCR was performed 
using the green GoTaq Flexi kit (Promega) with 200 pmol of each of the degenerate 
primers 1bF and 525bR, using a hot start reaction in a PCR machine (Biometra) 
with a gradient of annealing temperatures between 39 °C and 46 °C. Two main 
products, 500 base pairs (bp) and 300 bp, were obtained, and these were excised 
from the gel and purified using the QIAEX II Gel Extraction Kit (Qiagen). The 
cleaned 1bF-525bR PCR products were used as template for nested PCR with the 
primers 40bF and 333bR (Supplementary Table 1) at an annealing temperature of 
44 °C. From both the 300-bp and the 500-bp fragments, nested PCR products of 
the expected 310 bp were obtained. The purified PCR products were cloned into 
pGEM-T Easy (Promega), and ligated plasmids were transformed into Escherichia 
coli Top 10 (Invitrogen) by heat shock. Purified plasmid DNA was sequenced with 
the m13F primer at the sequencing facility of the Anthropogenetics Department of 
the Radboud University Nijmegen Medical Centre. The sequences were analysed 
using Vector NTI software. 

The C- and N-termini of the Acidianus A1-3 CS, hydrolase gene were cloned 
by inverse PCR: 10 jig genomic Acidianus DNA was digested in three different 
reactions with either EcoRI, Rsal or Sau3AI. The EcoRI digestion was incubated 
for 2h before a 15-min inactivation at 80°C. From the Rsal and Sau3AlI digests, 
10 il samples were taken during incubation and were inactivated at 80 °C every 
5 min up to 25 min to prevent overdigestion, and the five inactivated samples were 
pooled. The digested genomic DNA fragments were ligated to form circular pieces 
of DNA using T4 DNA ligase (Fermentas). To prevent the ligation of multiple 
fragments at high DNA concentrations, three ligations were performed per digest, 
with decreasing DNA concentrations (2, 1 and 0.4ng pl '). Ligations were per- 
formed at 4 °C for 72 h, and the three ligations per digest were pooled. Inverse PCR 
was performed using the primers F4inv and R3inv (Supplementary Table 1) with 
Phusion High-Fidelity DNA polymerase (Finnzymes), in the supplied buffer HF 
according to the manufacturer’s instructions. With each of the three different 
ligated digests, a dominant PCR product was obtained with the sizes as follows: 
2.5 kilobases (kb) (EcoRI), 5kb (RsaI) and 1.3 kb (Sau3AI). The products were 
cloned into pJET1/blunt (Fermentas) and sequenced using the pJET sequencing 
primers provided with the GeneJET PCR Cloning Kit (Fermentas). All three PCR 
products yielded sequence data corresponding to the Acidianus CS, hydrolase 
gene (the S. solfataricus SSO1214 homologue) and the promoter region upstream, 
as well as the gene downstream of the CS, hydrolase. The amino acid sequence 
derived from the complete 615-bp CS, hydrolase gene was found to be 75% 
identical to the S. solfataricus SSO1214 gene product AAK41461. It has a predicted 
isoelectric point of 5.92 and a molecular mass of 23,576 Da. 

Cloning S. solfataricus P2 $SO1214. DNA encoding the annotated carbonic 
anhydrase of S. solfataricus P2 was cloned into pET-30a(+) (Novagen) after 
PCR amplification using the primers S.solf P2 CS, hydrolase Fl, F2, Rl and R2 
(Supplementary Table 1) in all four combinations as described above, using 120 ng 


genomic S. solfataricus P2 DNA and an annealing temperature of 59.3 °C. PCR 
products were cloned into pGEM-T Easy (Promega) according to the manufac- 
turer’s instructions and were checked by sequencing with the primer m13F 
(Supplementary Table 1); they were then cloned into pET-30a(+) and rechecked 
by sequencing with the primer T7rev (Supplementary Table 1). 

Phylogenetic analysis. In total, 96 carbonic anhydrase or carbonic-anhydrase-like 
protein sequences were retrieved from GenBank and via BLAST. These sequences 
were aligned using MUSCLE (http://www.ebi.ac.uk/Tools/msa/muscle/), and 
phylogenetic analysis was performed in MEGA 4.0 (ref. 36), using the neighbour- 
joining method”, and displayed in a bootstrap consensus tree inferred from 500 
replicates**. Branches corresponding to partitions that were reproduced in less than 
50% of bootstrap replicates were collapsed. The evolutionary distances were com- 
puted using the Poisson correction method” and are shown in units of the number 
of amino acid substitutions per site. Positions containing alignment gaps and miss- 
ing data were eliminated only in pairwise sequence comparisons (pairwise deletion 
option). There are a total of 1,382 positions in the final data set. 

Stopped-flow spectrometry. Carbonic anhydrase activity was measured by 
stopped-flow spectrometry at 25 °C, using the changing pH indicator method", 
on an RX 2000 stopped-flow system (Applied Photophysics) coupled to an HP- 
8453 spectrophotometer. Enzyme solution containing the pH indicator phenol red 
(0.2 mM phenol red (As57 nm3 Merck) in 50 mM HEPES and 50 mM NaSO,j (ionic 
strength 0.15), pH 7.5) was mixed 1:1 with CO,-saturated water (32.9 mM) in the 
reaction chamber to a total volume of 200 pul. Measurements were taken every 1 s. 
Expression construct. A synthetic gene encoding the Acidianus CS, hydrolase— 
which was codon optimized for E. coli and had a G+C content of 47%, a stop 
codon, and a 5’ Ndel restriction site and a 3’ Notl restriction site—was obtained 
from GenScript and cloned into pET24b using the Ndel and NotI restriction sites. 
The resultant plasmid, pET24bCSjhydsynth, was used to transform E. coli 
BL21(DE-3) cells. 

Site-directed mutagenesis. Single and double base substitutions and a deletion 
mutant of the synthetic Acidianus CS, hydrolase gene in pET24bCS,hydsynth 
were made using a QuikChange Site-Directed Mutagenesis Kit (Stratagene). Eight 
primers were designed with the required mutation in the centre of the primer 
(Supplementary Table 1). The whole plasmid was amplified using the overlapping 
primer sets (125 ng each), 15ng methylated template DNA, 5mM dNTPs and 
2.5U PfuUltra High-Fidelity DNA Polymerase (Stratagene), according to the 
manufacturer’s instructions, with an annealing temperature of 55°C. Dpnl 
(10U) was added to the PCR products to degrade the methylated template 
DNA at 37°C for 2-16h. Treated PCR products (2 ul) were transformed into 
E. coli Top10 by heat shock, and transformants were selected on LB plates contain- 
ing 30pgml_' kanamycin. Plasmid DNA was isolated from up to six clones 
(FlexiPrep Kit; Amersham Biosciences) and was checked by sequencing using 
the primer T7rev. Plasmids containing the correct mutation(s) were transformed 
into E. coli BL21 for expression studies. 

Protein expression. E. coli BL21(DE-3) cells that had been transformed 
with pET24bCS,hydsynth were grown and induced in 5-61 LB medium for 
native protein production or in minimal medium supplemented with 
L-selenomethionine (SeMet) for SeMet-labelled protein production. Cells were 
collected by centrifugation at 5,500g for 15 min at 4°C and resuspended in PBS. 
A Complete, EDTA-Free Protease Inhibitor Cocktail Tablet (Roche), 1 mM MgCl, 
and a spatula tip of solid DNase I were added, after which the cells were lysed by 
two passes through a fluidizer at 100 psi (Microfluidics). The resultant suspension 
was incubated in a water bath at 70°C for 30 min and then cooled on ice. The 
mixture was then centrifuged at 73,000g for 40 min at 4 °C, and the supernatant 
(80 ml) was collected. This supernatant was dialysed overnight against 21 gel 
filtration buffer (25mM Bis-Tris propane and 200mM NaCl, pH 6.5, adjusted 
with HCl), after which the solution was concentrated to 5 ml by ultrafiltration 
(Amicon, 30 kDa molecular weight cutoff; Millipore) and submitted to gel filtra- 
tion on a 300 ml Superdex 200 HR column. The fractions containing pure recom- 
binant CS, hydrolase were identified using 15% SDS-PAGE, pooled and 
concentrated, as described above, to an Azgo of ~5. 

Alternatively, E. coli BL21 cells containing synthetic native or mutant enzyme 
constructs (see below) were grown and induced in 50 ml cultures, collected as 
described above and lysed for 30 min at 37 °C with 2 mg ml ' lysozyme followed 
by three rounds of bead beating (1 min at 30 Hz; Retsch) with intermittent cooling 
on ice. The lysates were incubated at 70 °C for 10 min to denature E. coli proteins 
and centrifuged at 13,000g for 10 min to remove precipitated proteins. 
Crystallization and crystal handling. Plate- and box-like single crystals were 
obtained using the hanging-drop vapour diffusion method, mixing 2 il protein 
solution with 1 pl reservoir solution containing 25% PEG 1000, 200 mM Li,SO,, 
100 mM LiCl and 50 mM phosphate/citrate buffer, pH 4.2, followed by equilib- 
ration against this reservoir solution at 12 °C for 4 days (monoclinic crystals) or 
20°C (orthorhombic crystals). When SeMet-labelled protein was used, 5mM 
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dithioerythritol was added to the set-ups. Crystals were flash-cooled in liquid 
nitrogen, after cryoprotection by soaking in reservoir solution containing 30% 
PEG 1000. 

Structure determination. A 2.7 A resolution single-wavelength anomalous dis- 
persion data set was collected from an SeMet-labelled crystal at the X10SA beam 
line of the Swiss Light Source at a wavelength of 0.9790 A (Supplementary Table 3). 
All data were processed using XDS*, and care was taken to exclude data not 
conforming to Wilson statistics. Using SHELXD”, a plausible solution with 60 
selenium sites was found only in approximately 1 of 500 trials. The best solution 
was used as input into autoSHARP” for phase determination (phasing power for 
anomalous data 0.838, and FOMacentric 0.33 before solvent flattening), which 
resulted in an excellent electron density map, into which one monomer of the 
protein was built manually using Coot**. This monomer was then copied 16 times 
to complete the model of the asymmetrical unit. The structure was then refined 
using CNS“ and REFMACS (ref. 45) software against the single-wavelength 
anomalous dispersion data set and finally against a 2.4 A resolution data set 
collected at the same beam line at a wavelength of 0.98089 A from a crystal soaked 
in potassium thiocyanate, using a test set defined in thin resolution shells because 
of the high non-crystallographic symmetry. The structure of the orthorhombic 
crystal form was solved by molecular replacement using PHASER“ and refined as 
described above against data collected at X10SA at a wavelength of 0.97178 A from 
a crystal of native protein purified from Acidianus that had been soaked in uranyl 
acetate. For the orthorhombic crystal form, eight-fold non-crystallographic sym- 
metry restraints were applied during refinement. Using the PISA server*’, the 
hexadecameric structure of two interlocking octameric rings was detected both 
in the asymmetrical unit of the C2 crystals from the SeMet-labelled protein and in 
the primitive orthorhombic crystals of the native protein. For the monoclinic 
structure, the number of residues in the allowed, additional allowed and generously 
allowed regions of the Ramachandran plot were 89.0, 11.0 and 0.1%, respectively. 
For the orthorhombic structure, these values were 90.4%, 9.6% and 0.0%, respec- 
tively. No residues were in the disallowed regions of the Ramachandran plot in 
either structure. 

Analytical ultracentrifugation. The oligomeric state of CS, hydrolase in solution 
was investigated by analytical ultracentrifugation at three concentrations 
(Azgo! “™ = 0.25, 0.5 and 0.75, corresponding to 0.25, 0.5 and 0.75 mg ml!) in 
25 mM HEPES/NaOH, 25 mM KCL, pH 7.5, with and without 250 mM NaCl. The 
protein solution was centrifuged at 130,000g in a ProteomeLab XL-I ultracentri- 
fuge (Beckman) at 20°C using 2-sector cells with a 1.2-cm optical path length. 
Absorption data were analysed using SEDFIT™. For each combination of protein 
and salt concentration, two species were detected: a small one, with a sedimenta- 
tion coefficient of 7.8-8.5S, corresponding to 20-23% of the protein; and a large 
one, with a sedimentation coefficient of 13.0-13.9S, corresponding to 74-78% of 
the protein. For comparison, the program HYDROPRO” was used to estimate the 
sedimentation coefficients of the hexadecamer and octamer from the crystal struc- 
ture. The calculated values were 14.9S and 8.88, respectively. 

SAXS. SAXS was performed at the X12SA beam line, cSAXS, at the Swiss Light 
Source, with the protein at Aggy’ ™ = 7.0 and 3.5, corresponding to 7 and 3.5 mg 
min | in 25mM HEPES/NaOH, 25 mM KCI, pH7.5. Solutions were added to 
2-mm diameter quartz capillaries and kept at 10 °C during measurement. X-rays 
were used at 12.4 keV, and 200 measurements of 0.25 s each were recorded over 10 
positions along the length of the capillary, which was mounted at a detector 
distance of 200 mm. Background measurements with the buffer only were taken 
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using the identical capillaries, positions and measurement protocol. Data were 
used to a maximum momentum transfer of 0.4 A” '. Scattering curves calculated 
for the octameric structure by using CRYSOL” did not agree with the measured 
data, whereas calculated scattering curves for the hexadecamer were in better 
agreement with the measurements, although there remained a notable discrepancy 
at 0.08 A” '. However, fitting a linear combination of the theoretical curves for the 
octamer and hexadecamer using the program OLIGOMER" resulted in excellent 
correspondence after the fitting procedure converged to volume fractions of 
0.8-0.9 for the hexadecamer and 0.1-0.2 for the octamer, depending on the sample. 
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Online education 


Internet-based degree programmes are gaining acceptance, 
but doubts remain about their suitability for graduate science. 


BY SARAH KELLOGG 


hen Lauren McBryde Gray 
began her graduate studies at 
the University of North Carolina 


Wilmington, she was a chemist working at 
a biopharmaceutical company on the east 
coast of the United States, happily married 
and thinking about starting a family. Her busy 
schedule couldn’t accommodate the regimen- 
tation of on-campus courses. “It was impor- 
tant I bring in an income for my family, but I 
wanted to get my master’s to give me an edge 
in terms of my career,’ says Gray. “I needed 
another option, a better option.” 

That option was the university’s hybrid 
online master’s degree in chemistry. The 


innovative Corporate MS Chemistry Program 
offers online courses to professionals working 
at partner companies. Students learn online 
and complete compatible research in labora- 
tories at their workplace, supervised by two 
university professors and a senior colleague 
from the company who has a PhD. “It really 
turned out to be win-win,’ says Gray. 

For Gray, studying online was the perfect 
solution. That's not always the case, however. 
The number of online science graduate-degree 
programmes is slowly rising in the United 
States and other countries, including the 
United Kingdom and Australia (see go.nature. 
com/9yefal for a sample). But they have not 
kept pace with the growth of online degrees in 
fields such as computer technology, education 


and business. Studying science through dis- 
tance learning can be problematic because 
it provides few or no opportunities for col- 
laboration with peers or intensive laboratory 
research. But in fields in which these aspects 
are less crucial, studying science online at the 
graduate level has become accepted and use- 
ful for those seeking a flexible schedule. And 
some hybrid programmes offer the hands-on 
experience that scientific studies require. 


A POPULAR ALTERNATIVE 

Online learning eases pressure from burgeon- 
ing university enrolment, limited campus 
facilities and the increasing cost of higher 
education; it feeds interest in life-long learning; 
and it is convenient, especially given advances 
in web-based communications in the past 
decade. “I do not predict the demise of the 
residential campus,” says Michael Lambert, 
executive director of the Distance Education 
and Training Councilin Washington DC, a US 
accreditation agency for online learning. But 
campuses are at capacity in many respects, he 
adds. “We don't have enough seats or dorms,” 
says Lambert, “and colleges and universities 
and students and their parents cannot afford 
for residential campuses to grow much larger” 

According to the US Department of Educa- 
tion, 20% of all undergraduates in the United 
States — some 4.3 million students — took at 
least one distance-learning course in 2007-08, 
the latest year for which data are available. 
About 800,000, or 4% of all undergraduates, 
took their entire programme through distance 
education. 

The number of online graduate students 
is smaller. About 22% of all US students with 
bachelor’s degrees — 760,000 of them — took 
distance-education courses in 2007-08. Of 
that number, some 302,000 took their entire 
programme online. More than 23,000 students 
took life-sciences courses online, and 4.2% of 
all life-sciences students took their entire pro- 
gramme through distance learning. 

Other countries’ institutions continue to 
expand offerings. The government and uni- 
versities in Britain, for example, have been 
particularly aggressive in their efforts to woo 
overseas students to university distance-learn- 
ing programmes. According to the United 
Kingdom Council for International Student 
Affairs, some 115,000 international students 
use distance learning to study at UK higher- 
education institutions. 

Universities have struggled to design online 
programmes for science and engineering. > 
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> “Itisn’t simple to create a programme that 
meets all the requirements” for scientific grad- 
uate work, says Kenneth Green, founder and 
director of the Campus Computing Project in 
Encino, California, the largest ongoing study 
of the role of information technology in US 
colleges and universities. 

Essential features of an on-campus science 
degree that are hard to replicate online include 
teamwork and laboratory-based research. These 
are a particular problem for online PhD pro- 
grammes. Arthur Motta, a nuclear engineer at 
Pennsylvania State University (Penn State) in 
University Park, teaches online and campus- 
based courses to graduates. He believes that the 
online model is inadequate for PhD research, in 
terms of both learning and financing. 

“Online definitely works for certain master’s 
programmes, but it’s not good in my view for 
the PhD,’ says Motta. For example, princi- 
pal investigators typically receive grants that 
include student support in the form of tuition 
fees and stipends. That support comes with 
expectations — that PhD students conduct 
research projects on campus under the super- 
vision of their principal investigator. “I'm their 
supervisor,” says Motta. “They have tasks to 
perform on the project.” 


UNIVERSITIES EXPERIMENTING 

Despite the challenges, many universities 
are experimenting with online master’s pro- 
grammes. Faculty members say that online 
learning-management systems such as Black- 
board and Moodle, to which teachers can 
upload their course 
content and recom- 
mended reading 
lists, translate on- 
campus curricula 
into effective online 
courses. For exam- 
ple, a teacher would 
use the same lectures, 
slides, problem sets, 
quizzes and tests for 
an online graduate 


physics course as for “We created a 
an on-campus one. way for them 
At some universities, to get throug: h 
online materials for @graduate 
distance learners are Programme 
also made availableto and do some 
on-campus students, significant 

to allow them to research 
watch lectures again work for their 


or to review slides 
that they might have 
missed in class. 
Universities have developed inventive solu- 
tions to research concerns. Students can come 
to the campus for brief residencies at the begin- 
ning, middle or end of their studies, work at 
partner institutions such as a national labora- 
tory or another university, or do research in 
their workplaces. “We created a way for them to 


companies.” 
Jimmy Reeves 


get through a graduate programme and do some 
significant research work for their companies,” 
says Jimmy Reeves, chair of the department of 
chemistry and biochemistry at the University of 
North Carolina Wilmington and an instructor 
for the corporate master’s degree. 

Online students can also do research 
through remote and virtual laboratories. 
Virtual labs, used widely in undergradu- 
ate courses, are expanding into master’s 
programmes. They allow students to com- 
plete coursework by conducting simulated 
experiments in fully interactive animations 
of biomedical, chemical and physics labs. For 
example, students can combine chemicals 
and check whether the results are in line with 
their predictions. Alternatively, students can 
conduct original research projects over the 
Internet by remotely controlling instruments 
in distant laboratory facilities. As universities 
become comfortable with new technologies 
and different learning models, they will be less 
reluctant to offer hard sciences through dis- 
tance learning, argues Frits Pannekoek, presi- 
dent of the International Council for Open and 
Distance Education, a membership organiza- 
tion based in Oslo, and president of Athabasca 
University, a distance-learning centre in Can- 
ada. Critics, however, say that campus visits 
and virtual labs cannot offset the loss of men- 
toring opportunities, especially when students 
are completing a thesis or dissertation. 


OPPORTUNITIES FOR STUDENTS 

Faculty members say that online programmes 
are especially effective for older students who 
have the discipline to complete their work with 
little supervision and are assertive enough to 
seek help when they need it. Because course 
materials are always available, students can 
access materials whenever necessary. Some 
critics argue that online learning offers too 
many distractions, but distance learners say 
that physical classrooms are no guarantee of 
attention or attendance, and online courses 
often allow for a more focused experience. 

“You basically have the ability to come home 
from work and unwind and put on a lecture 
while you cook dinner,’ says Michael Mierzwa, 
a nuclear engineer with Westinghouse Elec- 
tric Company in Pittsburgh, Pennsylvania. 
Mierzwa was attracted to Penn State’s degree 
in nuclear engineering because it was entirely 
online. “It’s hard to work all day and then come 
home and study, but these programmes give 
you the wherewithal to pound through it and 
get on with your career,’ he says. 

Online programmes let students tailor their 
degrees to their needs. At Penn State, Mierzwa 
could choose between a master’s of science or 
a professional science master’s degree (PSM). 
The standard master’s required research time 
on campus or in the workplace, whereas the 
PSM, which does not require original research, 
allowed Mierzwa to work entirely online. He is 
more interested in the business side of nuclear 
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engineering than in research, so he chose the 
PSM. “It allowed me to drill down more, find 
more depth,’ says Mierzwa. 

Although there are many reputable online 
science programmes, some distance-learn- 
ing institutions are questionable, merely 
places where students pay a fee and receive a 
degree. Most universities with online courses 
and degrees that are 
accredited promote 
that fact, and prospec- 
tive students should 
do their research 
before registering 
for any online pro- 


gramme. 
Many online degree 

programmes are 

indeed of high quality. 

a : A 2009 report from 
Online the US Department of 
definitely works — Education conducted 
for certain a meta-analysis of 
master’s 99 existing studies 
programmes, of all types of online 
butit’snot good _ instruction, and con- 
in my view for cluded that in terms 
the PhD.” of learning outcomes 
Arthur Motta and programme 


design, undergradu- 
ate and graduate courses compared favourably 
with on-campus study across a range of disci- 
plines, including the sciences. 

Moreover, employers do not seem to be 
biased against online degrees. A 2010 survey by 
Eduventures, a US higher-education research 
and consulting organization based in Boston, 
Massachusetts, found that alumni of various 
online programmes saw improvements in 
their salaries and titles after acquiring an online 
degree. “These findings seem to refute the pop- 
ular media perception that employers favour 
traditional face-to-face degrees rather than 
online degrees,’ says Kamelia Valkova Turcotte, 
a senior analyst at Eduventures. Many employ- 
ers are now partnering with universities to assist 
their employees’ online degree programmes by 
providing on-site laboratory space. Observers 
expect more science programmes to offer online 
master’s degrees in the coming years, as they 
respond to economic and enrolment pressures. 

Beyond the research and laboratory con- 
siderations, the success of online graduate sci- 
ence education ultimately depends — as does 
the success of on-campus learning — on the 
students and their commitment to a course 
of study. “It’s all about teaching and learning,” 
says John Bourne, executive director of the 
Sloan Consortium in Newburyport, Massa- 
chusetts, a US research and advocacy group 
that studies distance learning. “It’s not about 
whether it’s online or on the ground. You can 
have good or bad experiences in both areas.” m 


Sarah Kellogg is a freelance writer in 
Washington DC. 
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The athletes of science 


Some researchers may recoil from metrics-based assessment. But they should know 
how others are keeping score, say Maria Pavlou and Eleftherios P. Diamandis. 


most impressive and memorable statistics. 

They might have the highest points-per- 
game average for a season in basketball (30.1; 
Michael Jordan), the most major titles in tennis 
(16; Roger Federer) or the most goals scored 
in a single football season (53; Lionel Messi): 
these numbers define their careers. 

Scientists have plenty of their own statistics. 
Performance metrics of various sorts are popu- 
lar, in some cases increasingly so. And, although 
metrics are flawed, scientists should be aware of 
how they are used and how they can be bent to 
aid careers. Conventional indices such as refer- 
ence letters and CVs are neither redundant nor 
passé, but it would be a mistake for any scientist 
to disregard quantitative bibliometric indices. 

The idea of devising quantitative measures to 
rank scientists’ performance originated in the 
early twentieth century. But it was Eugene Gar- 
field, founder of the Science Citation Index, who 
pioneered bibliometrics and showed the value 
of tracking citations. He found, for example, that 
Nobel laureates publish five times the number of 
papers of most researchers, and that their work 
is cited 30-50 times more often’. Such revela- 
tions helped to popularize metrics. Among the 
measures in use today are the journal impact 
factor, which tracks the average number of cita- 
tions of articles in a journal, and the h-index, 
which combines an author's productivity with 
the citation frequency of his or her papers. 


IE sport, the best players publicize their 


MIXED FEELINGS 

Surveys of department heads, researchers and 
administrators reveal a variety of views on the 
use of such metrics”. Some use them frequently; 
others all but ignore them. Metrics are taken 
into account, to varying degrees, in decisions on 
hiring, awarding tenure or promotions, adjust- 
ing salaries and allocating resources. A highly 
cited paper in an intermediate-impact jour- 
nal (impact factor 5-10) may be viewed more 
favourably by a hiring committee than a poorly 
cited paper in a top-impact journal. 

Metrics have some powerful advantages: 
they are objective and quantitative. But they 
also suffer from some major deficiencies that, 
if not taken into account, can make them 
misleading. For example, citation counts and 
other metrics typically assign equal credit 
to all authors of a collaborative paper. So it 
is not unusual to find technology specialists 
(often middle authors who make technical 


contributions) who have more citations than 
some department chiefs. One of our own 
technicians has 1,734 lifetime citations and an 
h-index of 26 (that is, 26 papers with at least 26 
citations each) — scores comparable to those 
ofa 50-year-old university professor. Technol- 
ogists are certainly important, but there should 
be some way to distinguish the different levels 
of contribution made by different individuals. 

An author with a few highly cited papers can 
garner many citations, despite never pursuing 
a career or even a degree in science. For exam- 


ple, one of the papers reporting the original 
methodology for the polymerase chain reac- 
tion was published’ in 1987 with two authors: 
Kary Mullis (Nobel Prize in Chemistry, 1993) 
and Fred Faloona, a supporting staff member 
with five papers (all with Mullis) and no pub- 
lications since 1992. Faloona has more than 
10,000 lifetime citations from just two papers. 
And then there's the ‘bystander effect’: some- 
times highly cited individuals are average, but 
not extraordinary, researchers who happen to 
have collaborated with distinguished scientists. 

Still, metrics can supplement a CV. Jorge 
Hirsch, h-index’ inventor anda physicist at the 
University of California, San Diego, suggests 
that the index could help guide tenure deci- 
sions at universities and membership decisions 
at professional societies. For example, for physi- 
cists, an h value of about 18 could be the thresh- 
old for a full professorship; 15-20 could mean 
a fellowship in the American Physical Society; 
and 45 or higher could mean membership 


of the US National Academy of Sciences. 

Every year, one of us (E.P.D.) prepares a 
bibliometric-analysis booklet as a companion 
to his CV. The booklet includes graphics show- 
ing how many publications he has had each year, 
the number of publications per journal impact 
factor, career citations by citing year, h-index, 
most highly cited papers, field rankings and the 
international rankings of his laboratory. But to 
underscore the limitations of such analyses, the 
cover page includes a disclaimer, adapted from 
a saying by Albert Einstein: “Many of the things 
you can count, don’t count. Many of the things 
you can't count, do count” 


KNOW THE ENEMY 

Young scientists should familiarize themselves 
with the various indices, and prepare their own 
analyses of their scientific output. Understand- 
ably, they will start with low metrics scores, but 
their portfolios will grow during their careers, as 
will most of their metrics and indices. And eval- 
uators should certainly take age into account. 

No single index, formula or description will 
capture the diverse contributions of scientists 
to society. Scientists are involved not only in 
discoveries and publications, but also in teach- 
ing, mentoring, organizing scientific meetings, 
serving on editorial boards and lecturing. But, 
Einstein's caveat notwithstanding, bibliometric 
analyses are here to stay. And as long as their 
shortcomings are taken into account, they can 
be valuable, allowing observers to draw conclu- 
sions about a scientist’s productivity, quality of 
research and impact in science. 

Like professional athletes, young scientists 
should focus on their performance, their CVs 
and their relationships with advisers and col- 
leagues — but they should also be aware of 
their metrics, and how they can improve the 
statistics. Otherwise, they might not measure 
up to the competition. = 


Maria Pavlou is a PhD student in laboratory 
medicine and pathobiology, and Eleftherios P. 
Diamandis is professor of laboratory medicine 
and pathobiology, at the University of Toronto, 
Ontario, Canada. 
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YOU, IN EMULATION 


BY KATHRYN CRAMER 


due back in two weeks for synchroni- 

zation, but I kept you out much longer, 
running up huge fines. The librarian was 
very nice and didn't make me pay right away, 
but said that she had very little discretion; 
that the fines were set by the library system 
and your publisher. 

Iam a writer and I was looking for an act- 
ing teacher to help me improve how I read 
my work out loud. Although, of course, 
your publisher didn’t tell me your real name, 
your bio on the package really spoke to me. 
I thought we would get along, and we did, 
from the very first moment that you were 
uploaded into my card slot. 

Suddenly, there you were. It surprised me 
that you were my height. I’m not sure if that 
is an artefact of the software: virtual teachers 
scaled to the same size as their students, or 
whether you really are (or were) five foot six. 

I was ready to work. I began to read aloud 
from the draft of my novel, and I could tell 
before you even said anything that the nar- 
rative voice just didn’t work. She wasn't in 
the story, but aloof, above it all. This moment 
felt almost like telepathy, but I imagine it was 
accomplished by transmissions from the 
software of the emotional colouring of what 
you were about to say. I had written the story 
in the third person. But somehow with you 
sitting there listening to my voice as I read, 
[had to ask, “Who is this third person, any- 
way?” She is me, of course, but you looked 
at me with your blue eyes over your glasses, 
and I knew that was no answer. 

The next day, you tried another angle. You 
had me sit down on the brown couch and 
told me to pretend I didn’t know who I was 
or where I was. After I suggested a few solu- 
tions such as looking at my driver's licence 
to find out, I asked: “How does this scene not 
end in the emergency room?” Afterwards, I 
remembered that in that bio of yours I liked 
so much, a sort of artist’s statement, you said, 
“The question “Where am I?’ is my preoccu- 
pation.’ You didn't remind me. Instead, you 
told me that youd had an ischaemic event a 
while back that had left you unable to speak 
for a couple of days. 

Your paintings hung on the walls of our 
virtual space. Quiet, expectant landscapes 
and abstracts involving brightly coloured 
rectangles. You said you couldn't paint 
any more. In between sessions, I sent you 
long e-mails, and you would reply in only 


I checked you out of the library. You were 


Brought to book. 


a sentence or two if you 
replied at all. You said 
you couldn't write much 
these days, even though you used 
to write whole books. The word 
agraphia came to mind. 

At the two-week mark, what you were 
trying to teach me snapped into focus and 
I began to hear the narrative voice and to 
write and write and write, and when I read 
the words out loud, they were beautiful and 
pure and often when I read, you seemed on 
the brink of tears. The fines began to mount. 

I wrote and wrote, and in between I 
worked out how to help, how to give you 
back writing and painting. First I write a 
sentence; you're next. 

First I make a brush stroke, then you do. 
I took you hiking in the virtual woods and 
brought body paints. I told you to paint the 
sunset on my back as a sketch, and when 
we got back to our usual virtual space, you 
painted the sunset on canvas. 

I would have kept you longer, except 
for the pain. Implants require synchroni- 

zation and constant 


> NATURE.COM upgrades. To keep you 
FollowFutureson § from being returned 
Facebook at: to the library, I had 
go.nature.com/mtoodm © to stay away from the 
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synchronization stations. 
After a few days, reminders in 
powder-blue letters began to swim 
across my vision. After a few more 
days, a physical sensation came with 
them. After a month, the reminder notes were 
accompanied by excruciating pain because 
I was in violation of the licence agreement. 

I negotiated the fines, pointing out that 
I had solved your writing and painting 
blocks. The publisher acknowledged that 
those blocks were a known issue with the 
product, but said that my work was unneces- 
sary, that in the new upgrade, the product no 
longer had the desire to write or paint. And 
whatever changes there were in my copy 
would merely be averaged in and so won't 
have much net effect. 

Iasked to be put in contact with the native 
version, with the real person. The publisher 
said that this was contractually prohibited, 
but impossible in any case as you died of a 
stroke a few days after you were recorded. 

I went to the library again this morning 
and checked you out. We've got two weeks. = 


Kathryn Cramer lives in Westport, New 
York. She is a science-fiction editor and 
writer currently editing the Year's Best SF 
series with her husband, David G. Hartwell. 
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Few inputs can reprogram biological networks 


ARISING FROM Y. Liu, J. Slotine & A. Barabasi Nature 473, 167-173 (2011) 


Liu, Slotine and Barabasi! identify subsets U of nodes in complex 
networks, which are required to exert full control of these networks. 
Control in this context means that for each possible state S of the 
network there exist inputs for all nodes in U, which are sufficient to 
force the network to state S'. Application of the methodology to gene 
regulatory networks suggests that roughly 80% of all nodes must be 
controlled to drive such a network. This seems to contradict recent 
empirical findings** in the cellular reprogramming field. 

Mammalian cells have been shown to be efficiently reprogrammable 
towards other cellular phenotypes”. Controllability of such complex 
transitions in transcriptional networks underlying cellular phenotypes 
appears to be an intrinsic biological property and is being used for 
the development of novel disease models and cellular therapeutics in 
regenerative medicine. However, in contrast to technical or social 
networks, biological networks show a high degree of intrinsic co- 
regulation. Co-regulation appears to quench the admissible space of 
states of gene expression networks to a combinatorial expression space 
with relatively low dimensionality. Recent publications suggest that this 
subspace is spanned by only a few (five to fifteen) genome-wide differ- 
ential expression patterns, which are—surprisingly—sufficient to char- 
acterize most observable biological phenotypes” ’. Each of these patterns 
is given by weighted sums over the expression of relatively extensive 
gene sets contributing coherently to the respective pattern’’. Although 
these empirical studies suggest linear structures underlying the respec- 
tive subspaces, nonlinear or star-shaped topologies cannot be excluded. 

Moreover, genome-wide co-regulation may result in dynamic res- 
ponse features, as observed in chemical networks. Hence, owing to 
their low dimensionality, full control and reprogramming of biological 
systems may be achieved by only a few key control factors, seemingly 
contradicting the more general concept put forward by ref. 1. Recent 
experimental studies show proof of concept for a wide range of repro- 
gramming of biological systems using overexpression of only one to 
five transcription factors” °'°, which effectively regulate only a subset 
of downstream genes. We note that the number of nodes (five or 
fewer genes out of about 30,000) needed to fully control a biological 
system’ °”° is much less than the estimate of 80% of all nodes’. 

Thus, for the special case of biological systems, it may be sufficient to 
weaken the standard definition of controllability of networks—control 
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of large sets of nodes as suggested by ref. 1—to mean the controllability 
of restricted, biologically admissible network states. 
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REPLYING TO F.-J. Miller & A. Schuppert Nature 478, doi:10.1038/nature10543 (2011) 


Miller and Schuppert’ describe an exception to our finding” that 
roughly 80% of the nodes must be controlled to gain full control over 
gene regulatory networks. Yet our result hides subtleties that reveal as 
much about controllability as about the limits of our current under- 
standing of biological networks. 

Reprogramming cells from one phenotype towards another is not 
the same as full control of the regulatory process. Controllability, as 
explored by us, concerns the ability to drive the system from any point 
to any other point in the state space in a finite time. Stable phenotypes 
are best seen as attractors of the system’s regulatory dynamics’, sepa- 
rated by potential barriers that are difficult to cross under normal 
circumstances*’, Reprogramming therefore concerns our ability to 
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drive the system from one attractor to another—that is, controllability 
within a restricted subspace of the original state space. Consequently, 
such a procedure may require fewer driver nodes. 

This can be illustrated by the example of a directed star with one 
central hub and N — 1 leaf nodes, capturing the common situation in 
which the expression of N — 1 genes is co-regulated by a single tran- 
scription factor. To gain full control of the system’s dynamics we need 
to control Np = N — 1 nodes, that is, all but a single node. (Here, Np 
denotes the number of driver nodes and N is the total number of 
nodes in the network.) Yet, if we do not care about the relative differ- 
ences in the expression pattern of the leaf nodes, then it is sufficient to 
control the central hub, that is, Np = 1. This does not imply full 
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Figure 1 | Controlling the central hub of a directed star will cause the system 
to get stuck in a particular subspace. In the left panel, the control input 1, 
(represented by the blue wavy line) is applied to the central hub x, of the 
directed star with N = 3 nodes. The system will be stuck in a plane described by 
3)X2(t) = azxX3(t), as shaded in the right panel. The system is controllable 
within this subspace, that is, it can be driven from any initial state (blue point) to 
any desired final state (green point) in finite time. However, it is not 
controllable in the whole state space. 


control, because the system is stuck in a subspace of the full state 
space (see Fig. 1), but such limited control may be sufficient for 
reprogramming. 

Our estimate that approximately 80% of nodes must be controlled 
to fully control the gene regulatory network is based on very in- 
complete maps of regulatory networks. Current estimates indicate 
that the protein-protein interaction maps cover less than 10% of all 
potential protein-protein interactions”. For regulatory networks we 
lack such estimates, but probably the coverage is even lower. 
Assuming that with advances in mapping the number of regulatory 
links will increase tenfold, our analytical results (equations (4) and (5) 
in ref. 2) predict that mp (=Np/N) will drop to (0.8)'° ~ 11%. Hence, 
the observed large np is partially driven by the incompleteness of 
regulatory maps. Furthermore, the presence of gene auto-regulation 
will create self-edges (that is, nodes pointing to themselves), making 
Mp drop even further (M. Posfai, Y.-Y.L., J.-J.S. and A.-L.B., 
manuscript in preparation). Finally, mp could be further reduced by 
nonlinearities in the dynamics, which would potentially help the 


system explore its state space, enhancing its controllability and thus 
decreasing np (refs 8 and 9). 

Thus, controllability offers an appropriate framework within which 
to explore many cellular processes, potentially including cell re- 
programming. To reach its full potential, however, we need a better 
understanding of both the structure and the dynamics of regulatory 
processes. 
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RNAi promotes heterochromatic silencing through 
replication-coupled release of RNA Pol II 


Mikel Zaratiegui't, Stephane E. Castel'’?, Danielle V. Irvine'+, Anna Kloc'}, Jie Ren!, Fei Li*+, Elisa de Castro*, Laura Marin’, 
An-Yun Chang", Derek Goto'+, W. Zacheus Cande’, Francisco Antequera’, Benoit Arcangioli®® & Robert A. Martienssen!? 


Heterochromatin comprises tightly compacted repetitive regions 
of eukaryotic chromosomes. The inheritance of heterochromatin 
through mitosis requires RNA interference (RNAi), which guides 
histone modification’ during the DNA replication phase of the cell 
cycle’. Here we show that the alternating arrangement of origins of 
replication and non-coding RNA in pericentromeric heterochro- 
matin results in competition between transcription and replication 
in Schizosaccharomyces pombe. Co-transcriptional RNAi releases 
RNA polymerase II (Pol II), allowing completion of DNA replica- 
tion by the leading strand DNA polymerase, and associated histone 
modifying enzymes’ that spread heterochromatin with the replica- 
tion fork. In the absence of RNAi, stalled forks are repaired by 
homologous recombination without histone modification. 

In fission yeast, the Rik1/CLRC (recombination in K, cryptic locus 
regulator) complex silences heterochromatin via Clr4 and Lid2, which 
methylate histone H3 lysine 9 (H3K9) and demethylate histone H3 
lysine 4 (H3K4), respectively’. This complex is recruited in part by 
RNA interference, which processes non-coding transcripts found in 
the pericentromeric heterochromatin’*. Interactions between the 
RNA-induced transcriptional silencing (RITS) complex and CLRC 
have recently been found*’, but spreading of the Rikl complex into 
reporter genes depends on the catalytic activity of RNAi, and the 
mechanism remains unknown’. Recently, we found that Cdc20 and 
Mms19 interact with Rikl and are required for histone modification’. 
Cdc20 is the catalytic subunit of the leading strand DNA polymerase 
Pol, whereas Mms19 is a regulatory subunit of the Pol II transcription 
factor TFIIH. Both proteins participate in transcription coupled nuc- 
leotide excision repair (TC-NER) which depends on damage-stalled 
Pol II to detect structural lesions in the DNA which are repaired by the 
Pol ¢ after Pol I release’. 

The pericentromeric heterochromatin of fission yeast comprises 
outermost (otr) repeats called dg (5 kilobases) and dh (1-6 kb), flanked 
by innermost (imr) repeats (~6kb) containing clusters of transfer 
RNA genes (Fig. la). Histone H3 lysine-9 methylation is associated 
with dg and dh repeats (Fig. 1b), but ends abruptly at the tRNA clusters, 
and so is confined to heterochromatin’. The dg and dh repeats are 
transcribed by RNA polymerase II"°, and processed into siRNA clusters 
up to 4.5kb in length (Fig. 1b). To investigate the extent of siRNA 
precursor transcripts, we first cloned and sequenced dh and dg repeat 
complementary DNA from dcr1A mutants (Fig. 1c). Polyadenylation 
sites were then identified using rapid amplification of cDNA ends-PCR 
(RACE-PCR; Methods), and sequencing showed they were located 
within the clusters of short interfering RNA (siRNA; Fig. 1c). In pre- 
vious studies of dcr14 mutants, Pol II enrichment was detected by 
chromatin immunoprecipitation (ChIP)'’, whereas transcriptional 
run-on (TRO) analysis indicated over-accumulation of forward (but 
not reverse) transcripts’. We found that these Pol II ChIP (cen-dg) 
and TRO probes lie downstream of ‘forward’ polyA sites (Fig. 1d), 


indicating inefficient termination and Pol II readthrough in the absence 
of RNAi. To confirm readthrough, northern blots of polyadenylated 
and total RNA from dcr1A, agolA and rdp1A mutants were probed 
with strand-specific probes. Transcripts corresponding to full-length 
dh (1.3 kb) and dg (1.3-2.3 kb) cDNA clones were enriched in polyA+ 
RNA, as expected, but much longer readthrough transcripts up to 
4.5 kb could also be detected (Supplementary Fig. 1), indicating that 
polyadenylation was highly inefficient at these internal sites. 

Inefficient polyadenylation is a strong indication of failure to release 
RNA polymerase II’, and we proposed that slicing’ and dicing’ of 
nascent transcripts via RNAi promotes 3’-5’ degradation by the 
exosome’ and release of RNA polymerase II from the 3’ end’. The 
exosome is required for silencing, consistent with this idea'*'’. To 
examine Pol II release, we performed ChIP-seq with Pol II antibodies, 
and found peaks of both poised (S5 phosphorylated) and elongating 
forms (S2 phosphorylated) of Pol II in dcr14 mutants that corre- 
sponded to the polyadenylation sites on each strand (Fig. 1b, c). 
Peaks of siRNA accumulation mapped just downstream. Thus siRNA 
in wild-type cells accumulated where Pol II was released (Fig. 1b). 

siRNA accumulate during S phase’ and we found that siRNA clusters 
ended abruptly at the replication origin homology regions contained 
within each repeat’® (Fig. 1b). To assess the influence of DNA replica- 
tion on Pol II accumulation we blocked replication in high concentra- 
tions of hydroxyurea and performed ChIP-seq using Pol II antibodies. 
In arrested dcr14 mutants, Pol II accumulated throughout the otr 
repeats, but in dividing dcr1A cells, Pol II accumulation was absent 
from replication origins (Fig. 2a and data not shown). To test if Pol II 
was expelled by replication fork progression (Fig. 2b), hydroxyurea- 
arrested dcr1A cells were released into the cell cycle (Fig. 2c). As pre- 
dicted, accumulation at replication origins was quickly lost, and Pol II 
was only found between origins, closer to promoters’®, in each sub- 
sequent S phase. 

Failure to release RNA polymerase II during S phase is a strong 
and robust signal for DNA damage*. To monitor DNA repair, the 
hydroxyurea-arrested cells contained a Rad22 protein fused with 
yellow fluorescent protein (Rad22-YFP). Rad22 (Rad52 in budding 
yeast) is essential for homologous recombination and is associated with 
single-stranded DNA ends early during DNA repair. Chromatin 
immunoprecipitation showed that Rad22"***? was weakly associated 
with heterochromatic origins in wild-type cells arrested with hydro- 
xyurea, but quickly declined following release (Fig. 2c). In dcr1A 
mutants, on the other hand, Rad22R@452 peaked early in each successive 
S phase, indicating engagement of the repair machinery during hetero- 
chromatin replication’. To exclude the impact of hydroxyurea arrest 
on DNA damage, we also examined Rad22-YFP accumulation in 
untreated wild-type and dcr1A mutant cells by fluorescence micro- 
scopy (Supplementary Fig. 2). The results were consistent with chro- 
matin immunoprecipitation, in that six times as many dcr1/ than 
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Figure 1 | Transcription and replication of pericentromeric 
heterochromatin in fission yeast. a, Pericentromeric heterochromatin on 
centromere 3. dh (red), dg (green) and imr (magenta) repeats are shown, 
bordered by tRNA genes (brown). Replication origins (yellow) are found in 
each repeat. b, Tiling microarrays of K9me2 ChIP (light blue) and clusters of 
small RNA sequences (dark blue) from wild-type cells. ChIP-seq reads 
corresponding to poised (S5-P) and elongating (S2-P) RNA polymerase II 


wild-type cells had Rad22***? foci during septation (early $ phase). 
Therefore, Dcr1 activity prevents DNA damage and the engagement of 
homologous recombination at the centromere. 

We performed genetic tests to determine the role of RNAi in pre- 
venting DNA damage during S phase. DNA damage during replication 
can be rescued by homologous recombination repair, and we found 
that double mutants in the RecA homologue rhp51"*“"! and der1A or 
agolA were inviable or formed microcolonies (Fig. 2d). A similar 
requirement for Rhp51***°! has been demonstrated for convergent 
stalled replication forks’*, which are protected from collapse in fission 
yeast by a stable replication-pausing complex comprising Swil/Swi3 
and Mrcl (mediator of replication checkpoint 1)’°. Low concentra- 
tions of hydroxyurea stall replication forks, and we found that whereas 
dcr1A, ago1A and rdp1A cells were insensitive, double mutants with 
swi3A or mrc1A were very sensitive to low concentrations of hydro- 
xyurea (Supplementary Fig. 3). Similar results were obtained with 
camptothecin which causes arrest during S phase when the replication 
fork encounters the camptothecin-topoisomerase I complex. In gen- 
ome-wide epistasis tests, mutants in more than 30 genes, mostly 
encoding proteins involved in DNA repair and histone modification, 
interacted significantly with both mrcl1 and dcr1, forming a striking 
genetic network (Supplementary Table 1). This indicates that loss of 
Decr1 activity engages replication fork protection. 

To assess fork integrity, we examined replication of the repeats by 
two-dimensional gel electrophoresis using probes from the ura4 
transgene, which was inserted into a passively replicated dg repeat 
on chromosome 1 (Fig. 2b). In wild-type cells, we detected strong X 
intermediates, indicative of joint molecules, as well as the expected fork 
or Y molecules (Fig. 3a). Similar X-DNA sister chromatid junctions 
arise at origins*’ but also at stalled replication forks”. These 
X-molecules were unaffected in dcr14 (Fig. 3b) but reduced in 
mms19A, in swi6A and especially in clr4A cells (Fig. 3c-e). Both 
Mms19 and Clr4 interact with Rikl, and Mms19 participates in 
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enriched in dcr1A cells relative to wild-type (WT) cells are in black. c, CDNA 
clones (beige) from dcr1A cells. PolyA sites are indicated as vertical lines and 
correspond to peaks of Pol II. Arrows indicate the direction of ‘Forward’ 
transcription. d, Alignment of probes used in previous studies indicates that 
regions enriched for Pol II’’ (cen-dg) and transcriptional run-on probes’ (TRO) 
lie downstream of forward orientation polyA sites. 


transcription initiation’. Swi6 on the other hand is required to initiate 
replication within heterochromatic repeats’’, and recruitment 
depends on Clr4. Thus simultaneous replication and transcription of 
heterochromatic repeats promote local replication fork stalling. 

In wild-type cells (Fig. 4a), modified histones recruit Swi6 and the 
Rikl complex via chromo- and other domains. Swié promotes early 
replication, and the Rikl complex interacts with DNA polymerase ¢, 
which allows spreading of histone modification along with fork pro- 
gression’. Flanking tRNA genes (Fig. la) pause replication”, prevent- 
ing further spreading into neighbouring euchromatin’”’. Transcription 
during S phase stalls the replication fork, accounting for interactions 
between the replication and transcription machineries’, but RNAi 
releases Pol II, allowing replication to proceed. In the absence of 
RNAi (Fig. 4b), Pol II remains stalled at replication forks and signals 
DNA repair by homologous recombination, which restarts blocked 
forks™. The Rikl complex is lost along with the replisome, preventing 
spreading of heterochromatin into reporter genes, which lose H3K9 
methylation entirely. Recombination also removes modified histones 
from at least one of the two daughter chromatids” reducing, but not 
eliminating, methylation of the repeats as previously observed’. 

We tested this model in several ways. First, we predicted that the 
interaction between the Rikl complex and Pol ¢ should depend on 
RNAi, and we found that co-immunoprecipitation of Cdc20/Pol ¢ 
with Dos2/Clr7 was reduced in dcr1/ cells, along with H3K9me2 
(Supplementary Fig. 4). Second, we observed that mutants in the 
cyclin-dependent Pol II C-terminal domain kinase cdk9 display slow 
growth and loss of pericentromeric silencing and siRNA (Supplemen- 
tary Fig. 5). Cdk9 is a central regulator of transcription elongation 
that links cell-cycle-regulated pre-messenger RNA processing, co- 
transcriptional histone methylation and DNA damage’*. Finally, 
Clr4 has recently been found to have additional roles in recruiting 
the RITS complex to accessory Pol II factors’, providing a potential 
mechanism for Pol II release by RNAi. We found long transcripts 
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Figure 2 | RNA interference and DNA replication restrict RNA polymerase 
II accumulation and prevent DNA damage. a, Small RNA (blue) and Pol II 
ChIP-seq reads (black) and regions of significant enrichment (blue and red 
rectangles) from wild type and dcr1A on the right arm of centromere 1. b, A 
replication bubble is shown, initiated at one of the three origin homology 
regions at centromere 1 (yellow boxes). c, Chromatin immunoprecipitation for 
RNA Pol IT and Rad22"*“°? from hydroxyurea-arrested and released wild-type 
(dashed lines) and dcr14A (solid lines). Cell cycle progression after release from 
hydroxyurea block is monitored by septation index, which peaks coincident 
with S phase. d, Representative parental and non-parental di-type tetrads from 
crosses between rhp51A cells, defective in homologous recombination, and 
dcr1A or ago1A. 


indicative of strong transcriptional readthrough in clr44 mutant cells 
consistent with this model (Supplementary Fig. 1). 

In the budding yeast Saccharomyces cerevisiae, the Dicer-related 
RNase III Rntl releases Pol II during transcription termination’ 
whereas in Escherichia coli, failure of transcription termination stalls 
replication forks and triggers recombination”, providing a precedent 
for the mechanism we propose. According to this mechanism, 
transcription during S phase triggers histone modification, so long 
as RNA polymerase is released by RNAi, and not by homologous 
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Figure 3 | Replication fork stalling during heterochromatin replication. 
Replication intermediates in wild-type and mutant cells resolved by 2D gel 
electrophoresis and probed with the unique DS/E probe from the ura4 
transgene within the dg repeat on chromosome 1 (Fig. 2a). a, Schematic of 
replication intermediates in 2D gels indicates joint molecules (X), and forks (Y). 
b-f, Junction molecules indicate fork stalling in WT (b) and dcr1A mutant cells 
(c), and are reduced in mms19A (d), swi6A (e) and clr4A (f). 


recombination repair. In plants, fungi and invertebrates, heterochro- 
matic silencing may involve similar mechanisms (Supplementary 
Table 1), whereas in mammals, both X inactivation and imprinting 
require transcription of non-coding RNA in dividing cells*®. In each 
case, release of Pol II during S phase, by RNAi or by other means, could 
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Figure 4 | Replication-coupled transcriptional silencing through histone 
modification and RNAi. a, The Rikl complex (red hexagon) is recruited to 
heterochromatic replication forks by interactions with methylated histone 
H3K9me2 and with the leading strand DNA polymerase (Pol ¢, green). Swi6 
induces origin firing, but collision with RNA polymerase II (orange) stalls 
replication forks. RNAi releases Pol II by processing of pre-siRNA transcripts 
(red lines), allowing leading strand DNA polymerase to complete DNA 
replication and the associated Rikl histone modification complex (red 
hexagon) to spread histone modification (black circles). b, In the absence of 
RNAi, origins fire but Pol II is not released, stalling replication forks. Stalled Pol 
II signals repair via homologous recombination instead. Recombination could 
in principle occur with sister chromatids (shown here) or with other copies of 
the same repeat (not shown). DNA polymerase ¢ and the associated Rik1 
complex are lost along with the replisome, and fail to spread histone 
modification into neighbouring reporter genes. 
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allow fork restart and spreading of histone modification in a similar 
way. 


METHODS SUMMARY 


Non-coding transcripts were cloned from a cDNA phage library by hybridization 
to dh and dg consensus probes. Cloning and high throughput sequencing of sRNA 
was performed using an Illumina Genome Analyzer according to manufacturer’s 
instructions. Two-dimensional gel electrophoresis of replication intermediates 
from steady state cultures was performed with probes to the otr1::ura4™ insertion. 
For ChIP experiments, cultures were arrested in 15 mM hydroxyurea for 4.5h, 
released and collected at indicated times, to be crosslinked and processed for 
chromatin immunoprecipitation. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Yeast strains and methods. S. pombe strains used in this study are listed in 
Supplementary Table 2. Standard media and genetic protocols for fission yeast 
were used*'. Crosses between dcr1, agol and rhp51 deletion mutant strains were 
performed with strains of h*,h~ and smt-0 mating types, with comparable results. 
Lethality or sickness was observed in 90% of the double mutants as well as many 
single mutants, suggesting epigenetic effects inherited through meiosis. 

cDNA analysis. Total RNA was extracted from cells growing in YES by the hot 
phenol method. In brief, mid-log phase (0.5 attenuance at 600 nm) yeast cultures 
were incubated in SDS-acetate buffer (50 mM sodium acetate, 10 mM EDTA, 1% 
SDS pH5), with an equal volume of phenol pH 4.3 at 65 °C for 15 min, vortexing 
every 2 min for 30s. After centrifugation, the aqueous phase was extracted once 
with acid phenol, once with acid phenol:chloroform 1:1 and precipitated by addi- 
tion of 1/10 volume of 3 M sodium acetate pH 8 and 2 volumes of ethanol, resus- 
pended and treated with a DNA-free kit (Ambion). cDNA was reverse-transcribed 
from total RNA using oligodT adapters, and cloned into phage vectors as recom- 
mended by the supplier (Stratagene). More than 60 clones were purified by blot 
hybridization to dg and dh probes, and sequenced from both ends. Polyadenylation 
sites were identified by a string of T residues longer than the adaptor followed by 
matches to the S. pombe consensus sequence. Alignment to the genome sequence 
was possible because of single nucleotide polymorphisms between copies of the 
repeats, with the exception of chromosome 3, where the tandemly arranged dg and 
dh repeats are identical. Sequences containing a long stretch of A residues imme- 
diately downstream of known centromeric sequence were considered to represent 
the polyadenylated 3’ ends of centromeric transcripts. Polyadenylation sites of 
centromeric cDNA sequences from wild-type and rdr1A strains were determined 
using the ligation-mediated polyA test (LM-PAT) assay as previously described’? 
with the following message-specific primers: dglIIrevPolC, dglIIIrevPolD, 
dglIIIforPolC, dhIforPolA and dhIrevPolA. The PCR products amplified from 
LM-PAT cDNA were separated on an agarose gel, individually extracted and 
cloned into the pCR2.1-TOPO vector (Invitrogen) according to the manufacturer’s 
instructions. Cloned products were then sequenced in both directions. See oligo- 
nucleotide list for details (Supplementary Table 3). 

Northern blot analysis. Total RNA was extracted from cells growing in YES by the 
hot phenol method. PolyA* RNA was isolated from wild-type and mutant strains 
using Dynabeads Oligo(dT) (Invitrogen). Northern blot analysis was performed 
using NorthernMax-Gly (Ambion). Blots were probed for the dh transcripts 
(p30_F_T7 and p30_R_T3), dg transcripts (p33_F_T7 and p33_R_T3) and actin 
(act1F and actl1R_T7). See oligonucleotide list (Supplementary Table 3) for details. 
Small RNA library sequencing. Total RNA was extracted from cells growing at 
exponential phase in YES using hot phenol extraction. Small RNAs were isolated 
using flashPAGE followed by gel isolation. Small RNA libraries for Solexa/ 
Illumina sequencing were prepared according to the manufacturers’ instructions. 
Libraries were sequenced on an Illumina Genome Analyzer. The resulting 
sequence reads, including quality scores, were aligned to the S. pombe reference 
sequence with MAQ (http://maq.sourceforge.net/) using the ‘map’ function. 
Two-dimensional gel electrophoresis. Two-dimensional gel electrophoresis of 
centromeric DNA was performed as previously described’. DNA pellets were 
digested with Ncol/HindIII to generate restriction fragments of 3.7 kb for origin 
2055 and 4.0 kb for the otr::ura4* transgene at the centromere. The two restriction 
fragments were analysed using a probe to origin 2055 (ref. 20) and a probe that 
recognizes the ura4-DS/E minigene deleted region. See oligonucleotide list for 
details (Supplementary Table 3). 

Hydroxyurea synchronization and chromatin immunoprecipitation. 
Synchronization of log-phase rad22-yfp and rad22-yfp/Adcr1 S. pombe strains 
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was performed as previously described’. In brief, cells were treated with 15 mM 
hydroxyurea (Sigma) for 4.5h to synchronize in S phase. Cells were released by 
washing twice in hydroxyurea-free media and grown for 5h taking 50 ml samples 
every 30 min. Samples were fixed with 3% paraformaldehyde for ChIP as previ- 
ously described’. Chromatin was purified, sonicated and incubated with either 
anti-RNA Polymerase II (clone 83WG16, Upstate) or anti-green fluorescent protein 
(Roche) antibodies. Synchronization efficiency was estimated using the septation 
index. For chromatin immunoprecipitation quantification, immunoprecipitated 
DNA was amplified by quantitative PCR (qPCR) in duplicate. The following 
formula was used: enrichment =2~ (Cites ~ Chaat )IP (Chest — Circ WC) Where Citest 
and Ci, t; is the amplification cycle in which the signal surpassed the established 
threshold for the test and reference amplicons respectively, in the immunopreci- 
pitated (IP) and input DNA (wce) samples. The test regions were amplified with 
the following primers; p30F_qPCR and p30R_qPCR (dh region), p33F-qPCR and 
p33R_qPCR (dg region), and p20F and p20R (2055 origin of replication), and for 
the reference, actlIF_qPCR and actl1R_qPCR (act1). See oligo list for details 
(Supplementary Table 3). 

Chromatin immunoprecipitation sequencing and data analysis. Samples from 
log-phase DG21 and DG690 S. pombe strains were fixed (dividing cells) or first 
treated with 15 mM hydroxyurea (Sigma) for 4.5 h, washed once in hydroxyurea- 
free media and fixed (hydroxyurea-arrested cells)’. Chromatin was purified, 
sonicated and incubated with following anti-RNA polymerase II antibodies from 
Abcam: clone pospho-S2 (ab5095), and clone phospho-S5 (ab5131). For ChIP- 
seq, immunoprecipitated DNA was polished, ligated and amplified with the 
Illumina ChIP-seq sample preparation kit according to the manufacturer’s 
instructions, and sequenced in an Ilumina Hi-Seq 2000 Analyzer by 50 bp paired 
end sequencing as previously described*’. Analysis was carried out by a custom 
pipeline using Bowtie to align reads to the S. pombe genome and MACS 1.4 for 
significance enrichment calls**. Wiggle fragment pileup files were generated for 
each comparison to visualize the difference in Pol II enrichment between control 
(DG21) and treatment (DG690) immunoprecipitations at 10-bp intervals genome 
wide. The difference at each interval was calculated by first normalizing to the total 
number of reads between the two by scaling up, and then subtracting the normalized 
control value from the treatment. Detailed sequence statistics are provided in 
Supplementary Table 4. 

Microscopy. Samples were analysed using differential interference contrast 
(DIC) microscopy and fluorescence microscopy (Axioimager upright fluorescent 
microscope). 

Survival assays. Mid log-phase cultures were resuspended to 2 X 10° cells per ml 
and serially diluted tenfold. Dilutions were spotted on YES agar plates or YES agar 
containing the indicated amounts of hydroxyurea (Sigma), camptothecin (Sigma) 
or methyl methanesulphonate (MMS) (Sigma). Recovery was for 3 days at 30 °C. 
Co-immunoprecipitation. Cells were lysed by glass beads in HB buffer*’. Lysates 
were pre-cleared with protein A agarose beads, followed by 2-h incubation with 
antibodies against the haemagglutinin tag (Sigma, E6779) at 4°C. After washing, 
eluted proteins and input extracts were analysed by western blotting using antibodies 
against TAP (Sigma, P 2026). 
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Structural basis for a [4Fe-3S] cluster in the oxygen- 
tolerant membrane-bound | NiFe|-hydrogenase 


Yasuhito Shomura!’’, Ki-Seok Yoon*+, Hirofumi Nishihara? & Yoshiki Higuchi'** 


Membrane-bound respiratory [NiFe]-hydrogenase (MBH), a H,- 
uptake enzyme found in the periplasmic space of bacteria, catalyses 
the oxidation of dihydrogen: H,— 2H* + 2e7 (ref. 1). In contrast 
to the well-studied O,-sensitive [NiFe]-hydrogenases (referred to 
as the standard enzymes), MBH has an O,-tolerant H, oxidation 
activity” *; however, the mechanism of O, tolerance is unclear’. 
Here we report the crystal structures of Hydrogenovibrio marinus 
MBH in three different redox conditions at resolutions between 
1.18 and 1.32 A. We find that the proximal iron-sulphur (Fe-S) 
cluster of MBH has a [4Fe-3S] structure coordinated by six cysteine 
residues—in contrast to the [4Fe-4S] cubane structure coordinated 
by four cysteine residues found in the proximal Fe-S cluster of the 
standard enzymes—and that an amide nitrogen of the polypeptide 
backbone is deprotonated and additionally coordinates the cluster 
when chemically oxidized, thus stabilizing the superoxidized state 
of the cluster. The structure of MBH is very similar to that of the 
O,-sensitive standard enzymes except for the proximal Fe-S cluster. 
Our results give a reasonable explanation why the O, tolerance 
of MBH is attributable to the unique proximal Fe-S cluster; we 
propose that the cluster is not only a component of the electron 
transfer for the catalytic cycle, but that it also donates two electrons 
and one proton crucial for the appropriate reduction of O, in pre- 
venting the formation of an unready, inactive state of the enzyme. 

An increasing interest in exploring H, as a clean energy carrier has 
encouraged the use of hydrogenases as efficient catalysts for H, oxida- 
tion in fuel cells and for Hz production. However, a substantial obstacle 
is the inhibition or inactivation of hydrogenases by Oo. For this reason, 
elucidating the molecular basis of the O2 tolerance of MBH is crucial, 
and has implications for biotechnological development in this area. 

The standard [NiFe]-hydrogenase and MBH are phylogenetically 
related heterodimeric enzymes, sharing ~45% amino acid sequence 
identity (Supplementary Table 1 and Supplementary Fig. la, b). 
Whereas the former is mostly found in microorganisms living in strictly 
anaerobic environments, the latter is found as a component of the 
aerobic respiratory system in some bacteria®*. The heterodimeric unit 
of MBH associates with a membrane-anchored b-type cytochrome sub- 
unit (Cyt b) and the inner membrane through a carboxy-terminal trans- 
membrane helix in the small subunit, which is absent in the standard 
enzyme. The MBH-Cyt b complex catalyses the oxidation of H, at the 
periplasmic space to release protons and the reduction of quinones in 
the inner membrane to take up protons from the cytoplasmic side, so as 
to generate a proton gradient across the membrane. 

The overall structure of the heterodimeric unit of MBH is similar to 
those of the standard enzymes” (Supplementary Table 1), but we 
observed a dimer of heterodimers in an asymmetric unit of MBH 
crystals in which the transmembrane helices had been spontaneously 
cleaved off during purification (Fig. 1). In contrast to the heterodimeric 
structure of all of the standard enzymes reported so far'’, we consider 
the heterotetrameric structure of MBH to be biologically relevant for 


the following reasons: the contact area between the protomers 
occupies as much as 11% of the total surface area; and the C termini 
of the small subunits of the crystalline enzyme (which had been located 
near the Cyt b subunits in vivo) and the distal Fe-S clusters that transfer 
electrons to Cyt b face the same side of the complex. 

Our structural analyses have shown that the proximal Fe-S cluster of 
MBH is not a [4Fe-4S] type as in the standard enzyme, but a [4Fe-3S] 
type, where one of the corner sulphides is missing and two additional 
cysteine thiolates coordinate iron atoms (Fig. 2a—-d). The proposal 
made for MBH of Ralstonia eutropha H16 (based on the sequence 
alignment and spectroscopic analyses) that the proximal Fe-S cluster 
has a structure and electronic properties distinct from those of the 
standard enzyme did not conflict with the X-ray structure elucidated 
in this study. Specifically, Sy of Cys 25° (where superscript S (or L) 
denotes that the residue is in the small (or large) subunit) replaces a 
sulphide that would have bound Fel, Fe2 and Fe4 in the [4Fe-4S] 
cubane cluster to coordinate Fel and Fe2, and the Sy of Cys 158° 
coordinates Fe4 (Fig. 2a, b). Consequently, a bond between a sulphur 
and an iron atom in the standard [4Fe-4S] cluster is missing, but all 
four iron atoms are coordinated by four ligands. Despite the difference 
between the cluster species, the main-chain trace (that is, the Ca posi- 
tions) around the proximal cluster of MBH is very similar to that of the 
standard enzyme (Fig. 3). 

A structural comparison between different redox conditions 
revealed redox-dependent structural changes in the proximal Fe-S 
cluster. In the ferricyanide-oxidized condition, the amide nitrogen of 
Cys 26° replaced the $3 ligand of Fe2, and the cluster became more 
asymmetric (Fig. 2c, d). Hereafter, the structures of the [4Fe-3S] 
clusters in the H,-reduced and the ferricyanide-oxidized conditions 
will be termed the ‘symmetric’ and ‘asymmetric’ forms, respectively. 
Notably, the structures of the two forms were virtually identical except 
for the position of Fe2. The amide nitrogen in the asymmetric form is 
considered to be deprotonated, because the distance between the 
amide nitrogen and Fe? is 2.09 A, and C=O and N atoms in the amide 
and Fe2 are nearly coplanar. The deprotonation of the amide indicates 
that there is an additional negative charge at this amide/carbonyl, 
which should stabilize a higher oxidation state of the iron atoms in 
the asymmetric form, as described below. 

The electron density map of the cluster in the air-oxidized condition 
revealed that the crystals contain both the symmetric and asymmetric 
forms (Supplementary Fig. 2). A considerable body of X-ray diffraction 
data and the structural refinement of different crystals obtained under 
identical air-oxidized conditions confirmed that the ratio of the sym- 
metric form to the asymmetric form (that is, Fe2:Fe2' in Supplemen- 
tary Fig. 2) remained between 3:2 and 2:3. Full occupation of the 
asymmetric form has been observed only when an efficient oxidant 
such as ferricyanide is used. The symmetric form was recovered on 
reduction with H, or titanium(1m) citrate, or by excessive exposure to 
X-rays on the air-oxidized crystals. A redox-dependent conformational 
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[8Fe-4S] (medial) 


Figure 1 | Overall structure of MBH heterotetramer. The three structures in 
different oxidation conditions (Supplementary Table 2) are nearly identical to 
each other, therefore only the structure for the H-reduced condition is shown. 
Two large subunits (labelled L and L’) and two small subunits (S and S’) are 
shown with different colours. One of the two heterodimers is represented by 
surface models with different colours. In the left panel (front view), a pseudo 


change in an Fe-S cluster was also reported for the P-cluster in the 
nitrogenase from Azotobacter vinelandii’’, a [4Fe-4S] cluster in the 
NADH-quinone oxidoreductase from Thermus thermophilus’, and 
hybrid cluster proteins from Desulfovibrio vulgaris’, of which the sig- 
nificance on their reaction mechanisms was pointed out. As for the 
coordination of the peptide amide nitrogen to the iron atom, this is the 
third reported example for metalloproteins’>”*. 

The net charge of the [4Fe-3S]-6Cys in MBH established in this 
study is equal to that of the [4Fe-4S]-4Cys in the standard enzymes, 
assuming that the oxidation states of the iron atoms are identical. In 
addition, the numbers and positions of the charged residues around 
the proximal clusters of MBH in the symmetric form and the standard 
enzymes are similar except for the presence of two additional acidic 
residues in MBH (Fig. 3): Asp 51°, the carboxy group of which locates 
~12A from the edge of the proximal cluster, and Glu73", which 
neutralizes nearby Arg 74”. The fact that Glu 73" is not strictly con- 
served among MBH (Supplementary Fig. 1b) indicates that the residue 
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Figure 2 | Structural comparison of the proximal cluster in different redox 
conditions. a, b, H,-reduced condition. c, d, Ferricyanide-oxidized condition. 
Meshes represent F, — F, maps contoured at 3.00 where non-protein atoms 

were omitted in the calculation of F.. The numbers shown in b and d represent 
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dyad in the dimer of heterodimers is indicated; the C terminus of the small 
subunit (S) and the metal clusters in one of the heterodimers (L’ and S’) are 
labelled. The Fe-S clusters are termed proximal, medial and distal clusters with 
regard to their respective distance from the Ni-Fe active site. In the right panel 
(side view), the contact areas between the heterodimers are indicated with 
different colours to show which subunit is involved in the association. 


has little effect on the electronic property of the proximal cluster. 
Taken together, the redox transition of 4+/3+ is most probable for 
the proximal [4Fe-3S] cluster in the symmetric form during the H)- 
oxidation catalytic cycle; the oxidation states of the iron atoms are 
2Fe**:2Fe** and 3Fe**:1Fe** as in standard [4Fe-4S] clusters. This 
conclusion does not conflict with the results of electron paramagnetic 
resonance studies on MBH from different bacteria’® *!—the midpoint 
potentials of the proximal cluster in the higher diamagnetic and the 
lower paramagnetic states of R. eutropha H16 MBH and Aquifex aeo- 
licus MBH were reported to be -60 and +87mV, respectively. 
Furthermore, a predicted superoxidized paramagnetic cluster for R. 
eutropha H16 and A. aeolicus should correspond to the asymmetric 
form observed in this study based on the measured midpoint poten- 
tials of +160 and +232 mV, respectively’’”. The superoxidation of 
the proximal cluster to [4Fe-3S]°* should require the conformational 
change and concomitant donation of an additional negative charge 
described above. Previously proposed two single-electron transitions 
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the bond distances (A). The blue line indicates a bond that is newly formed by 
oxidation; the dashed red line indicates a bond that is found in the reduced form 
but is broken in the oxidized form. 
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Figure 3 | A comparison of charged residues around the proximal Fe-S 
cluster in MBH and the standard [NiFe]-hydrogenase from D. gigas. 
Carbon atoms of MBH and the standard enzyme are shown in yellow and cyan, 
respectively. All charged residues shown are conserved between two 
hydrogenases, except Asp 51° and Glu 73", which are only found in MBH. 
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of the proximal cluster in MBH (that is, [4Fe-3S]?*/4*/°*)" can thus 
be reasonably explained by the redox-dependent conformational 
change observed in this study. 

After catalytic oxidation of H, at the [NiFe] active site buried deep in 
the protein, two protons have to be transferred to the solvent through a 
putative proton-transfer pathway(s). Although the details of the 
proton-transfer mechanism remain to be elucidated even for the 
standard enzymes, an initial proton-transfer step involving the proto- 
nation of Cys 530" (residue numbering is for the standard enzyme 
from Desulfovibrio gigas?) and the subsequent proton transfer to 
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Figure 4 | Proposed proton-transfer pathways. a, The standard [NiFe]- 
hydrogenase from D. gigas (Protein Data Bank accession 1QY9). b, MBH in this 
study. The two proton-transfer pathways downstream of Glu 18" (Glu 28" in 
MBH) proposed in the standard enzymes are indicated with grey arrows in 
a. The carbon atoms of the large and small subunits are shown in cyan and pink, 
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Glu 18" is widely accepted’ (Fig. 4a). This is based on the higher 
temperature factors of the two functional groups of Cys 530" and Glu 
18" compared with those of the nearby atoms!’, and on mutagenic 
studies using Glu25"Gln and Glu25" Asp variants (Glu 25" corresponds 
to Glu 18" in the D. gigas enzyme) of Desulfovibrio fructosovorans”®. 
Two subsequent proton-transfer pathways that follow the protonation 
of Glu 18" were proposed for the standard enzyme'**: one via the 
residues around the proximal Fe-S cluster and the other via the 
C-terminal histidine (His 536"), the carboxy group of which coordi- 
nates the magnesium ion (Fig. 4a). The proton transfer from Cys 590" 
(Cys 530" in D. gigas) to Glu 28" (Glu 18" in D. gigas) is also likely to 
occur in MBH based on its structural similarity to the standard enzymes 
around the active site, but the alternative pathway to the C-terminal 
His 596" is unlikely, where Met 52" of MBH disturbs the array of water 
molecules that was identified in the standard enzymes (Fig. 4a, b). It 
seems that Met 52” by itself does not cause the structural difference, but 
the nearby Leu 21° affects the position of Met 52" by steric hindrance. 
Actually, Leu 21° is conserved only in MBH, whereas Met 52" is found 
in [NiFeSe]-hydrogenases and a H)-sensing [NiFe]-hydrogenase 
(Supplementary Fig. la, b). On the other hand, the pathway via the 
residues around the proximal Fe-S cluster has been identified in MBH 
(Fig. 4b). Notably, this pathway has a connection to the amide nitrogen 
that coordinates Fe2 in the asymmetric form of the proximal Fe-S 
cluster, indicating that the observed redox-dependent structural 
change in the proximal cluster is directly coupled to the redox states 
and reactions at the Ni-Fe active site, and that one proton liberated 
from the amide-hydrogen can be transferred to the active site to reduce 
Os, as described below. 

The standard enzyme is known to be inactivated reversibly to form 
two spectroscopically distinguishable states, Ni-A and Ni-B (ref. 25). 
Whereas Ni-A is preferentially produced by the oxidation of the active 
site with O, under electron-deficient conditions and requires con- 
siderable time for the reductive reactivation, Ni-B is formed in 
electron-rich conditions and is readily reactivated in a few seconds”. 
It has been suggested that preventing the Ni-Fe cluster from being 
trapped in the inactive Ni-A state is crucial for MBH to function in 
the presence of O) (refs 4, 22, 27). Therefore, having a higher redox 
potential and a two-electron donation property in the proximal cluster 
is advantageous in preventing the formation of Ni-A, as previously 
pointed out'’””. The structure of the Ni-Fe active site of MBH shows 
close similarity to those of the standard enzymes (Supplementary 
Fig. 3a, b)'*, excluding the possibility that the O, tolerance of MBH 


Mg 
His 596 


[4Fe-3S] 


respectively. The hydrogen bonds are represented by dotted grey lines. The red 
spheres are water molecules identified in the crystal structures. For His 19° 
(His 13° in D. gigas), the inversion of the imidazole ring is taken into account for 
the formation of hydrogen bonds. His 596" (His 536" in D. gigas) is the 
C-terminal residue of the large subunit. 
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is ascribable to the structural difference in the active site as suggested 
for [NiFeSe]-hydrogenase**. Furthermore, our structural analysis has 
confirmed that the physical exclusion of O, by the H, channel, pro- 
posed for H>-sensing [NiFe]-hydrogenases””*, is not the case for 
MBH, because the size and shape of the channels of MBH and the 
standard enzyme of D. vulgaris MF showed no particular difference 
that would affect the diffusion of O2 (Supplementary Fig. 4a, b). 
Finally, a recent study” using an MBH mutant that lacks the additional 
two cysteine residues in the proximal cluster has shown that the cluster 
is important for O, tolerance both in vivo and in vitro. Our results 
provide a structural basis that underlies the unprecedented function of 
the [4Fe-3S] cluster in the O, tolerance of MBH, which acquires a 
superoxidized state to supply two electrons and one proton for the 
reduction of O, by using a redox-dependent conformational change. 


METHODS SUMMARY 


MBH crystals were prepared anaerobically by mixing 2 tl of the protein solution 
consisting of 27.5 mg ml * MBH, 10 mM Tris-HCl (pH 7.4) and 2 mM DTT with 
2 ul of the reservoir solution including 100 mM PIPES (pH6.5), 15% PEG3350, 
300 mM Li,SO, and 5 mM DTT. To prepare the H2-reduced crystals, fresh crystals 
were soaked in buffer containing 100 mM PIPES (pH 6.5), 20% PEG3350, 300 mM 
Li,SO,, 20% glycerol and 1mM methyl viologen in a screw-capped vial with a 
silicon septum, and the crystals were reduced under 100 kPa of H, for 2 hat 313 K. 
The ferricyanide-oxidized crystals were produced by soaking the fresh crystals in 
buffer containing 100 mM PIPES (pH 6.5), 20% PEG3350, 300 mM Li,SO,, 20% 
glycerol and 5mM K,[Fe(CN)¢] for 1h at room temperature (between 293 and 
298 K) inside a glove box. The air-oxidized crystals were prepared by soaking the 
fresh crystals in the same buffer as for Hj-reduced crystals but without methyl 
viologen for 1h at room temperature outside the glove box. All crystals were 
cooled with liquid nitrogen inside the glove box with the exception of the air- 
oxidized crystals. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Production and preparation of crystals. The procedures for bacterial cultivation, 
purification and crystallization of MBH, and the preliminary crystallographic 
experiment have been described previously”’. Crystallization was performed inside 
an anaerobic glove box (Model F1024, Thermo Fisher Scientific) by the sitting- 
drop vapour diffusion method at room temperature. The inner solution was pre- 
pared by mixing 2 ul of protein solution consisting of 27.5 mg ml~' MBH, 10 mM 
Tris-HCl (pH7.4), and 2mM DTT with 21 of reservoir solution including 
100mM PIPES (pH6.5), 15% PEG3350, 300mM Li,SO, and 5mM DTT. 
Dark-brown crystals grew to maximum size (~0.2 X 0.2 X 0.1mm) within a 
few days. To prepare the H2-reduced crystals, fresh crystals were soaked in buffer 
containing 100 mM PIPES (pH 6.5), 20% PEG3350, 300 mM Li,SO,, 20% glycerol 
and 1 mM methyl viologen in a screw-capped vial with silicon septum. The reduc- 
tion of crystals was performed under 1 bar of H, for 2 h at 313 K, and was optically 
confirmed by the blue colour of reduced methyl viologen. The ferricyanide-oxi- 
dized crystals were produced by soaking the fresh crystals in buffer containing 
100 mM PIPES (pH 6.5), 20% PEG3350, 300 mM Li,SO4, 20% glycerol and 5 mM 
K,[Fe(CN).] for 1h at room temperature inside the glove box. The air-oxidized 
crystals were prepared by soaking the fresh crystals in air-saturated buffer contain- 
ing 100 mM PIPES (pH 6.5), 20% PEG3350, 300 mM Li,SO, and 20% glycerol for 
1h at room temperature outside the glove box. All crystals were cooled with liquid 
nitrogen inside the glove box with the exception of the air-oxidized crystals. 

X-ray diffraction data collection and structural refinement. The X-ray diffrac- 
tion data were collected at beamline BL41XU and BL44XU of SPring-8 (Hyogo, 
Japan) using Beamline Schedule Software (BSS)**. The crystals were maintained at 
90 K using a gaseous nitrogen stream during data collection. The diffraction data 
were integrated and scaled with the program HKL2000 (ref. 33). The structure was 
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determined by the molecular replacement method using the program MolRep 
from the CCP4 suite** with atomic coordinates of the enzyme from D. vulgaris 
ME (Protein Data Bank accession 1WUJ) as a search model. Refinement with 
isotropic B-factors for individual atoms was performed with the program Refmac5 
(ref. 35), and subsequent refinement with anisotropic B-factors and alternative 
conformations were carried out with the program SHELX-97 (ref. 36) without 
merging diffractions in Bijvoet pairs. For tentatively assigned oxygen atoms 
around cysteine thiolates in the [NiFe] active centre and Fe2 in the proximal 
Fe-S cluster, occupancy and B-factor values were alternately refined. In the final 
step of each refinement, hydrogen atoms were included. Visual inspection and 
manual correction of models were performed with the program COOT”. Data 
collection and refinement statistics are summarized in Supplementary Table 2. 
Figures were prepared with using the program PyMol (DeLano Scientific). 
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The crystal structure of an oxygen-tolerant 
hydrogenase uncovers a novel iron-sulphur centre 


Johannes Fritsch’, Patrick Scheerer*, Stefan Frielingsdorf', Sebastian Kroschinsky’, Barbel Friedrich’, Oliver Lenz! 


& Christian M. T. Spahn?* 


Hydrogenases are abundant enzymes that catalyse the reversible 
interconversion of H, into protons and electrons at high rates’. 
Those hydrogenases maintaining their activity in the presence of 
O, are considered to be central to H2-based technologies, such as 
enzymatic fuel cells and for light-driven H, production’. Despite 
comprehensive genetic, biochemical, electrochemical and spectro- 
scopic investigations**, the molecular background allowing a 
structural interpretation of how the catalytic centre is protected 
from irreversible inactivation by O, has remained unclear. Here we 
present the crystal structure of an O2-tolerant [NiFe]-hydrogenase 
from the aerobic H2 oxidizer Ralstonia eutropha H16 at 1.5A 
resolution. The heterodimeric enzyme consists of a large subunit 
harbouring the catalytic centre in the H2-reduced state and a small 
subunit containing an electron relay consisting of three different 
iron-sulphur clusters. The cluster proximal to the active site 
displays an unprecedented [4Fe-3S] structure and is coordinated 
by six cysteines. According to the current model, this cofactor 
operates as an electronic switch depending on the nature of the 
gas molecule approaching the active site. It serves as an electron 
acceptor in the course of Hz oxidation and as an electron-delivering 
device upon O, attack at the active site. This dual function is 
supported by the capability of the novel iron-sulphur cluster to 
adopt three redox states at physiological redox potentials” ’. The 
second structural feature is a network of extended water cavities 
that may act as a channel facilitating the removal of water produced 
at the [NiFe] active site. These discoveries will have an impact on 
the design of biological and chemical H-converting catalysts that 
are capable of cycling H, in air. 

More than two billion years ago, ancient microbes exploited the 
reducing power of H, for their metabolism; until today H2 provides 
a valuable energy source which is used by H2-oxidizing uptake hydro- 
genases. The reverse process, that is, the liberation of H2, serves as 
safety valve to eliminate excessive reducing power under anaerobic 
conditions. This proton reduction reaction is catalysed by the group 
of H,-evolving hydrogenases. All hydrogenases use abundant trans- 
ition metals such as nickel and iron for catalysis, contrary to man- 
made H,-converting catalysts that predominantly rely on rare precious 
metals. 

Among three phylogenetically distinct types of hydrogenases, two 
enzyme classes prevail in nature. According to the metal content of 
their active sites they are classified as nickel-iron ([NiFe]) and di-iron 
([FeFe]) hydrogenases"®. [FeFe]-hydrogenases are highly productive in 
H, evolution, but are irreversibly inactivated during catalysis by even 
trace amounts of O, (ref. 11). [NiFe]-hydrogenases, however, function 
usually in the direction of H, oxidation and are less sensitive to Op. In 
most cases, O, reacts with the active site giving rise to a mixture of 
inactive states, denoted as Ni-A and Ni-B, depending on the nature of 
the oxygen ligand bridging the Ni and Fe atoms in the active site’”. 
Both inactive forms, however, can be reactivated under reducing 


conditions. Enzymes in the Ni-B state reactivate rapidly, whereas the 
recalcitrant Ni-A state requires long-term reactivation that may occur 
exclusively in vitro’*'*. Consequently, a prerequisite for a hydrogenase 
to function in vivo in the presence of Oj is the strict avoidance of the 
Ni-A form and a continuous removal of the oxygen species related to 
the Ni-B state. These features are present in a small group of [NiFe]- 
hydrogenases that are designed to operate in mixtures of Hz and O; 
(ref. 15). Knallgas bacterium Ralstonia eutropha H16 harbours at least 
three [NiFe]-hydrogenases capable of oxidizing H2 at atmospheric po,. 
The best-characterized enzyme is the heterodimeric membrane-bound 
[NiFe]-hydrogenase (MBH), which is attached to the periplasmic side 
of the cytoplasmic membrane and feeds the electrons derived from H 
oxidation via a membrane-integral b-type cytochrome directly into the 
respiratory chain (Fig. 1)'®. Recent studies suggested that the iron- 
sulphur (Fe-S) cluster in the proximal position to the [NiFe] active site 
of MBH significantly differs in its electronic structure and function 
from conventional [4Fe-4S] cubanes, which are usually located at 
the corresponding position of O -sensitive [NiFe]-hydrogenases*”. 
Moreover, experimental evidence revealed that this particular cluster 
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Figure 1 | Overall structure of the membrane-bound hydrogenase from R. 
eutropha. The upper-left inset is a cartoon depiction of the cellular localization 
of MBH. Electrons from H, oxidation are transferred through a relay of Fe- 
S clusters via a b-type cytochrome (Cyt b) to the respiratory chain. The ribbon 
representation shows the large (blue) and small (green) subunits of MBH. The 
catalytic centre and the three Fe-S clusters are symbolized as spheres. The 
spatial arrangement of MBH cofactors is illustrated in ball-and-stick 
representation in the lower part of the figure. 
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has a crucial role in the O, tolerance of MBH’. However, all structural 
interpretations of O, tolerance have been made on the basis of crystal 
structures from O,-sensitive [NiFe]-hydrogenases'*!”. 

To solve the crystal structure of MBH, the heterodimeric enzyme 
was purified from the solubilized membrane fraction of R. eutropha 
cells according to an optimized cultivation and purification protocol 
resulting in highly active and homogeneous protein preparations’. 
Dark-brown MBH crystals were harvested under a reducing atmo- 
sphere containing 5% H, and 95% N>. The structure was solved by 
molecular replacement using the structure of the reduced hydrogenase 
of Desulfovibrio vulgaris Miyazaki F (Protein Data Bank accession 
1WUL) as the search template, and the model was refined to a reso- 
lution of 1.5 A. 

The two MBH subunits show the typical overall topology of crystal- 
lized O3-sensitive [NiFe]-hydrogenases (Fig. 1 and Supplementary Figs 
land 2)'*'”°. An initial omit electron density map readily uncovered 
the catalytic centre in the large subunit and the three Fe-S clusters in the 
small subunit (Supplementary Fig. 3). All four cofactors are spaced in 
distances of approximately 10 A that allow electron transfer at physio- 
logically relevant rates (Fig. 1). The bimetallic active site of MBH con- 
sists of a nickel atom coordinated by four cysteines, two of which are 
bridging ligands to the iron atom (Fig. 2a). Furthermore, the iron 
carries three diatomic ligands, one carbonyl (CO) and two cyanide 
(CN) groups. The distance of 2.6A between the two metal atoms 
agrees with the reduced conformation of the active site’. Consistent 
with previous spectroscopic observations*’, the first coordination 
sphere of the MBH catalytic centre is very similar to that of O,-sensitive 
[NiFe]-hydrogenases (Supplementary Fig. 4). This indicates that the 
O, tolerance of MBH does not rely on a significant modification of the 
catalytic centre’. 

Three Fe-S clusters in the MBH small subunit conduct the electron 
flow between the [NiFe]-active site and the cytochrome b (Fig. 1). The 
distal cluster relative to the active site is a cuboidal [4Fe-4S] centre 
coordinated by three cysteines and one histidine (Fig. 2d). It is shielded 


Figure 2 | Metal cofactors of the MBH. The blue meshes represent 2F, — F. 
electron densities contoured at 2.00. a-d, The 2F, — F, electron densities 
perfectly fit with the [(CysS),Ni(1-CysS)2Fe(CN)2(CO)] centre (a), a proximal 
[4Fe-3S] cluster coordinated by six cysteine-derived sulphurs (b), a medial 
[3Fe-4S] cluster coordinated by three cysteine residues (c), and a distal [4Fe-4S] 
cluster coordinated by three cysteine residues and one histidine (d). All 
cofactors are shown in ball-and-stick representation; the coordinating amino 
acid side chains are depicted as stick models. 
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from the solvent by a 3jo-helix representing the first part of the 
carboxy-terminal «-helical extension of HoxK (Fig. 1 and Supplemen- 
tary Fig. 1), which is essential for both anchoring the hydrogenase to 
the membrane and a tight connection to cytochrome b'*. Three 
cysteine residues are involved in coordination of a [3Fe-4S] cluster 
which occupies the medial position, as observed in other [NiFe]- 
hydrogenases'”!*”° (Fig. 2c). The most surprising feature of MBH 
was observed at the position proximal to the active site. Instead of a 
common [4Fe-4S] cluster, a novel type of Fe-S cluster was found 
comprising four iron atoms but only three sulphides (Fig. 2b and 
Supplementary Fig. 3c). The conformation of this unprecedented 
[4Fe-3S] cluster is maintained by four cysteines (Cys 17°, Cys 20°, 
Cys 115°, Cys 149°, where S indicates the small subunit), which are 
conserved in all [NiFe]-hydrogenases, and two additional cysteines 
(Cys 19°, Cys 120°) exclusively found in O,-tolerant enzymes (Sup- 
plementary Fig. 5)**. In fact, Cys 19° and Cys 120° ligate three of the 
four iron atoms resulting in an open distorted conformation of the 
cluster (Fig. 3). In a concerted manner, Cys 120° and Cys 1498 with- 
draw Fe3 from the cuboidal structure resulting in enlarged Fe-Fe dis- 
tances of 3.5 and 4.0 A to Fel and Fe4, respectively (Fig. 3). Notably, the 
typical Fe-Fe distance in [4Fe-4S] and [3Fe-4S] clusters is 2.7 A (refs 
17, 18, 20). The position of the missing sulphide in the [4Fe-3S] cluster, 
compared to a [4Fe-4S] centre, is occupied by the thiolate sulphur of 
Cys 19° (Fig. 3). 

The [4Fe-3S] cluster is clearly distinct from distorted, partially 
damaged clusters found in crystal structures of some standard hydro- 
genases’””°. Moreover, it is unique among Fe-S centres with unusual 
structures (Supplementary Fig. 6). Surprisingly, the MBH proximal 
cluster shares similarity with one half of the reduced P-cluster of 
nitrogenase’. The P-cluster resembles a tandem of two cuboidal 
4Fe-3S modules bridged by a single sulphide’. The resulting [8Fe- 
7S] centre is coordinated by six cysteine residues, two of which serve 
as bridging ligands. Interestingly, one of the two 4Fe-3S modules was 
found in a highly distorted conformation, which is analogous to the 
structure of the [4Fe-3S] cluster in MBH (Fig. 3 and Supplementary 
Fig. 6). 
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Figure 3 | Architecture of the proximal [4Fe-3S] cluster. a, Schematic model 
and structure of the [4Fe-3S] cluster and the corresponding coordinating 
cysteine ligands in the MBH small subunit. b, The proximal [4Fe-4S] cluster of 
the O3-sensitive standard [NiFe]-hydrogenase from D. vulgaris Miyazaki F 
(Protein Data Bank accession 1WUL). In the upper schemes, the sulphides and 
cysteine-derived sulphur atoms are labelled in green and black, respectively. 
The cluster structures are shown in ball-and-stick representation; the 
coordinating amino acid side chains are depicted as stick models. 


©2011 Macmillan Publishers Limited. All rights reserved 


Oxygen tolerance implies that, upon approaching the catalytic cen- 
tre, O2 has to be removed reductively through an immediate delivery of 
four electrons and protons for the complete reduction of O; to water’. 
Because the oxidized active site is blocked and cannot bind Hg, elec- 
trons must be delivered by reverse electron flow**”"*. This feature 
seems to be linked to the previously determined, comparatively high 
redox potentials of the Fe-S clusters in MBH, which range from 
—180 mV to +160 mV. Moreover, the proximal cluster alone appears 
to undergo two redox transitions within an extraordinary narrow 
potential window from —60mV to +160mV (refs 4, 7, 9, 16, 23). 
The unique capability to carry two electrons at the same time at 
physiologically relevant potentials is in perfect agreement with the 
precisely assigned redox transitions mediated by the Fe-S clusters of 
the O,-tolerant, MBH-related hydrogenase I from Aquifex aeolicus’. 
Generally, the potential range between the 3+/2+ and 2+/1+ transi- 
tions of high-potential as well as ferredoxin-type [4Fe-4S] clusters is 
approximately 1,000 mV, placing either of the transitions beyond 
physiological relevance’*”*. However, the corresponding potential 
window of the two redox transitions mediated by the [4Fe-3S] cluster 
is only 220 mV (refs 8, 9, 23). 

Compared to the rather symmetric [4Fe-4S] clusters, the four iron 
atoms of the [4Fe-3S] cluster are coordinated by a higher number of 
cysteine-derived thiolates and a lower number of sulphides (Fig. 3). This 
structural information in combination with the interpretation of data 
from interdisciplinary studies’°>” suggest that three out of four iron 
atoms reside formally in the 3+ state in the most oxidized form of 
the [4Fe-3S] cluster, which has been substantiated by Mossbauer experi- 
ments performed on hydrogenase I from A. aeolicus*. According to the 
current understanding”, the increased thiolate:sulphide ratio elevates 
the redox potentials of all redox transitions mediated by an Fe-S cluster. 
In the case of the [4Fe-3S] cluster, this would explain the high value 
of —60 mV of the low-potential redox transition, but certainly not 
the relatively low value of the high-potential redox transition 
(Em = +160 mV). Conventional high-potential [4Fe-4S] clusters are 
embedded in a hydrophobic pocket. Consequently, the number of 
hydrogen bonds (particularly those from water molecules) to sulphur 
ligands is low, which in turn leads to a high covalency of the Fe-S bonds 
and poises the 3+/2+ transition to a physiological potential range”. 
A conserved water molecule in O3-sensitive hydrogenases is replaced 
by the Cys 120° thiolate sulphur. However, two well-defined water 
molecules (Wat366, Wat447, Supplementary Fig. 3b) were found 
within hydrogen-bonding distance to the [4Fe-3S] cluster. This leaves 
the question open as to how the protein environment tunes the high- 
potential transition of the proximal [4Fe-3S] cluster. Notably, the open 
conformation of the [4Fe-3S] cluster provides enhanced structural 
flexibility permitting redox-dependent rearrangements which have 
been observed, for example, for the P-cluster of nitrogenase” 

As discussed above, H, conversion in the presence of O, implies 
continuous production of H,O at the active site*”"*. Thus, HO needs 
to be continuously removed from the protein core to the surface. It is 
rather unlikely that HO molecules escape through the proposed 
hydrophobic gas channels*’, which are also observed in MBH 
(Supplementary Fig. 7). The structure of MBH uncovered water-filled 
cavities that connect the active site with the solvent (Fig. 4) and are 
absent in the crystal structures of O2-sensitive [NiFe]-hydrogenases 
(Supplementary Fig. 8). The additional cavity close to Cys 81" (where L 
indicates the large subunit) in MBH originates from the replacement of 
a bulky tyrosine, present in most O)-sensitve hydrogenases (Sup- 
plementary Fig. 5), by the small Gly 80” residue. Two gates on opposite 
sides of this water pocket seem to prevent unrestricted water/proton 
flow between the active site and the protein surface. The imidazol 
group of His220", which is conserved in O>-tolerant hydrogenases 
and the presence of which strictly correlates with the occurrence of 
Gly 80", disrupts the direct connection to a water-filled extension of 
the gas channel that reaches the [NiFe] centre (Fig. 4). The proposed 
water transfer to the surface is gated by salt bridges formed between 
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Figure 4 | Proposed water/proton transfer pathway from the MBH active 
site to the protein surface. Relevant water-filled cavities are coloured in blue; 
the water molecules are represented as red spheres. Other cavities including the 
mostly hydrophobic gas channel” are shown in grey. Amino acids in the large 
subunit involved in the proposed water/proton transfer (red arrows) are 
depicted as sticks. Cys 81" and Glu 308" are shown in both side-chain 
conformations (see also Supplementary Fig. 9). For the complete reduction of 
O, to two H30 molecules, four electrons must be delivered rapidly by the 
electron relay of the small subunit (Supplementary Fig. 7). 


Arg 88" and the glutamate residues, Glu 107" and Glu 308" (Fig. 4). 
Remarkably, we identified two rotamers of Glu 308", indicating a tran- 
sient opening of the gate (Supplementary Fig. 9). Thus, the structural 
data are compatible with a controlled translocation of water molecules, 
probably with accompanied proton transfer, supporting the model 
that O,-tolerant hydrogenases form water as a catalytic by-product 
during H; conversion in the presence of Ob. 


METHODS SUMMARY 


Crystallization and structure determination of MBH. The membrane fraction 
containing Strep-tagged MBH was prepared from R. eutropha cells grown hetero- 
trophically under microaerobic, hydrogenase-derepressing conditions’. After 
treatment with K3[Fe(CN).¢], membrane proteins were solubilized with Triton 
X-114, and MBH was purified via Strep-Tactin affinity chromatography. MBH 
crystals were obtained using sitting-drop vapour diffusion. On a microbridge, 
8 ul protein solution containing 0.1 mM MBH were mixed with 8 Ll of reservoir 
solution containing 20-30% PEG3350, and 0.1M _ Bis-(2-hydroxy-ethyl)- 
amino-tris(hydroxymethyl)-methane, pH 5.5-6.5. Crystals of MBH grew to full 
size in the drops within 5 days. The crystals were directly frozen in liquid nitrogen 
for screening and X-ray analysis at the synchrotron ESRF, Grenoble (France). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Media and cell growth conditions. Basic media and growth conditions for 
R. eutropha have been described previously”*’. Ralstonia eutropha HF649 was 
cultivated heterotrophically at 30 °C in a modified FGN mineral medium contain- 
ing 0.04% wt/vol fructose, 0.4% wt/vol glycerol and 40 11M FeC];. 4,000 ml cultures 
were shaken in baffled 5,000 ml Erlenmeyer flasks at 120 r.p.m. under air until they 
reached an optical density at 436 nm of 12 + 1. Cells were harvested by centrifu- 
gation at 6,000g for 15 min at 4°C. 

Isolation of membranes and Strep-Tactin affinity chromatography. Frac- 
tionation and purification steps were performed at 4 °C. Cells were re-suspended 
in buffer (3 ml 65 mM potassium phosphate [K-PO,], 300 mM NaCl, pH 7.0, per 
lg wet weight) containing Complete EDTA-free protease inhibitor cocktail 
(Roche Applied Science) and DNase I. The cell suspension was subsequently 
disrupted in a French pressure cell (SLM Aminco) via two passages at 1,241 bar. 
The resulting crude extract was treated by sonication (Branson Sonifier) and the 
cell debris was removed by low-speed centrifugation (4,000g, 30min). 
K;[Fe(CN).] was added to the crude extract at a final concentration of 50 mM. 
Membrane and soluble fractions were separated by ultracentrifugation (100,000g 
for 60 min). The membrane pellet was washed with re-suspension buffer contain- 
ing Complete protease inhibitor cocktail (Roche Applied Science), followed by 
ultracentrifugation (100,000g for 50 min). Membrane proteins were solubilized in 
10 ml buffer (65 mM K-POx, 300 mM NaCl, 2% wt/vol Triton X-114, Complete 
protease inhibitor cocktail, pH 7.02) per 1 g of membrane pellet by stirring on ice 
for 2h. After ultracentrifugation (100,000g, 45 min), the supernatant containing 
the solubilized membrane extract was loaded onto Strep-Tactin Superflow columns 
(IBA) which were run by gravity flow. To remove unbound proteins, the columns 
were washed with 12 bed volumes of re-suspension buffer and proteins were eluted 
with buffer containing 50 mM K-PO,, 150 mM NaCl, 5 mM desthiobiotin and 10% 
wt/vol glycerol at pH 7. MBH-containing fractions were pooled, concentrated and 
the buffer was exchanged to 40 mM K-PO,, 150mM NaCl, 10% wt/vol glycerol, 
pH5.5, with a centrifugal filter device (Amicon Ultra-15 PL-30, Millipore). Protein 
concentrations were determined with the BCA-kit (Pierce) with bovine serum 
albumin as standard. 

Hydrogenase activity assay. Spectrophotometric activity measurements of 
purified MBH were conducted at 30 °C in a rubber-stoppered cuvette containing 
H;-saturated K-PO, buffer (50mM, pH 5.5) and methylene blue as the electron 
acceptor™. 

Crystallization. MBH was used for crystallization at concentrations up to 14mg 
ml | (7-14mgml '). Crystallization screens by the sparse matrix method** were 
carried out by the sitting-drop vapour diffusion method testing more than 1,000 
crystallization conditions at 277 K and 291 K using 96-well MRC plates. Promising 
conditions were systematically screened further by changing protein concentra- 
tion, pH and the concentration of precipitation agents. Optimized MBH crystals 
could be grown by sitting-drop vapour diffusion method at 282 K using 24-well 
Linbro plates. Each sitting drop was prepared on a microbridge by mixing 
equal volumes (8 il each) of MBH and reservoir solution. The reservoir solution 
contained 20-30% polyethylene glycol 3350, 100mM Bis-(2-hydroxy-ethyl)- 
amino-tris(hydroxymethyl)-methane buffer, pH 5.5-6.5. Dark-brown MBH crys- 
tals appeared within 1-2 days and grew further for 4-5 days and were harvested 
under an atmosphere composed of 5% H2 and 95% N>. MBH crystals were flash 
frozen in liquid nitrogen with (90% (v/v) reservoir solution and 10% (w/v) poly- 
ethylene glycol 400) and without further cryoprotection. Fully grown crystals 
had dimensions of approximately 1.4 X 0.3 X 0.3mm’. Dissolved MBH crystals 
exhibited the same H2 oxidation activity of 130 mol H; per min per mg of protein 
as the protein before crystallization, demonstrating that the crystallization process 
preserved the catalytic activity of the enzyme. 

Structure analysis. Diffraction data collection was performed at 100K using 
synchrotron X-ray sources at ESRF, Grenoble, France, and BESSY II, Berlin, 
Germany. Best diffraction data were collected at beamline ID14-4 at ESRF, at 
2 = 0.9395 A. The crystal to ADSC Q315r detector distance was fixed at 
127.8mm for MBH. The rotation increment for each frame was 0.5° with an 
exposure time of 1s. To reduce significantly the radiation damage on the crystal 
we used the helical data collection protocol at beamline ID14-4"*. All images were 
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indexed, integrated and scaled using the XDS program package” and CCP4 
program SCALA**”’. Crystals belong to orthorhombic space group P2,2;2; 
(a = 73.09 A, b= 95.65 A, c=119.15 A, a= ff =y = 90°). Supplementary Table 
1 summarizes the statistics for crystallographic data collection and structural 
refinement. 

Initial phases for MBH in the H,-reduced state were obtained by conventional 
molecular replacement protocol (rotation, translation, rigid body fitting) using the 
[NiFe]-hydrogenase structure of Desulfovibrio vulgaris (Protein Data Bank accession 
1WUL) as the initial search model. Molecular replacement was achieved using the 
CCP4 program PHASER*”™ by first placing the MBH heterodimer (rotation func- 
tion [RFZ]: Z = 21.7; translation function [TFZ]: Z = 36.2 for MBH; RFZ and TFZ 
as defined by PHASER). In subsequent steps, torsion angle molecular dynamics, 
simulated annealing using a slow-cooling protocol and a maximum likelihood target 
function, energy minimization, and B-factor refinement by the program CNS” were 
carried out in the resolution range 74.6-1.5 A. After the first round of refinement, all 
[Fe,,-S,] clusters and [NiFe]-active site were clearly visible in the electron density of 
both oA-weighted 2F, — F- maps, as well as in the cA-weighted simulated anneal- 
ing omitted density maps (Supplementary Fig. 3). Restrained, individual B-factors 
were refined and the crystal structure was finalized by the CCP4 program REFMACS5 
and CCP4*”° and PHENIX". The final model has agreement factors Rgee and Reryst 
of 15.2% and 13.9%. Manual rebuilding of the MBH model and electron density 
interpretation was performed after each refinement cycle using the program 
COOT™. Structure validation was performed with the programs PROCHECK” 
and WHAT_CHECK™. Potential hydrogen bonds and van der Waals contacts were 
analysed using the programs HBPLUS* and LIGPLOT™. All crystal structure super- 
positions of backbone « carbon traces were performed using the CCP4 program 
LSQKAB™”. The solvent-accessible area was calculated using the PISA Server’. All 
molecular graphics representations were created using PyMol”. 
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Temperature-scan cryocrystallography reveals 
reaction intermediates in bacteriophytochrome 


Xiaojing Yang’, Zhong Ren”, Jane Kuk! & Keith Moffat!'? 


Light is a fundamental signal that regulates important physio- 
logical processes such as development and circadian rhythm in 
living organisms. Phytochromes form a major family of photore- 
ceptors responsible for red light perception in plants, fungi and 
bacteria’. They undergo reversible photoconversion between red- 
absorbing (Pr) and far-red-absorbing (Pfr) states, thereby ultimately 
converting a light signal into a distinct biological signal that 
mediates subsequent cellular responses”. Several structures of micro- 
bial phytochromes have been determined in their dark-adapted Pr or 
Pfr states*’. However, the structural nature of initial photochemical 
events has not been characterized by crystallography. Here we report 
the crystal structures of three intermediates in the photoreaction of 
Pseudomonas aeruginosa bacteriophytochrome (PaBphP). We used 
cryotrapping crystallography to capture intermediates, and followed 
structural changes by scanning the temperature at which the photo- 
reaction proceeded. Light-induced conformational changes in 
PaBphP originate in ring D of the biliverdin (BV) chromophore, 
and E-to-Z isomerization about the Cj;=C,. double bond between 
rings C and D is the initial photochemical event. As the chromo- 
phore relaxes, the twist of the C,; methine bridge about its two 
dihedral angles is reversed. Structural changes extend further to 
rings B and A, and to the surrounding protein regions. These data 
indicate that absorption of a photon by the Pfr state of PaBphP 
converts a light signal into a structural signal via twisting and 
untwisting of the methine bridges in the linear tetrapyrrole within 
the confined protein cavity. 

Cryotrapping and time-resolved room-temperature experiments 
are two main experimental strategies to study the structures of intrins- 
ically short-lived reaction intermediates*. To establish the molecular 
mechanism of Pfr/Pr photoconversion, we generated and cryotrapped 
intermediates between the reactant Pfr state and product state(s) in 
fully photoactive crystals of the photosensory core module (PCM)? of 
P. aeruginosa BphP. We followed the progress of the reaction by 
applying a ‘trap-pump-trap-probe’ strategy at variable pump tem- 
peratures (Fig. 1 and Supplementary Fig. 1; Methods Summary). Here, 
temperature mimics time: the higher the pump temperature, the 
greater the structural relaxation and the further a reaction proceeds 
along its pathway. 

We collected diffraction data from six crystals at ten pump tem- 
peratures between 100 and 180K (Supplementary Table 1), and cal- 
culated difference (Fiignt — Faark) electron density maps from 14 light 
data sets and 6 reference dark data sets (Supplementary Fig. 1a). In all 
maps, strong and highly significant difference electron densities are 
exclusively concentrated at the chromophore-binding sites embedded 
in the GAF domain (Fig. 1b). Singular value decomposition (SVD) 
analysis’ of difference densities within a 5-A radius of the aligned 
chromophores revealed three significant singular values, indicating 
three major, independent, light-induced structures (Supplemen- 
tary Fig. Ic, d). Because difference densities are largely consistent 
among the eight monomers in the asymmetric unit and between dif- 
ferent crystals illuminated at the same temperature, we averaged these 


densities by applying non-crystallographic symmetry (NCS), and 
focus here on the principal features common to the eight monomers. 

Difference densities vary markedly with pump temperature (Fig. 2). 
At the lowest temperatures they appear near ring D of the BV chromo- 
phore, which suggests that light-induced structural changes originate in 
ring D. As the temperature rises, difference densities expand to ring 
C and eventually to ring B and ring A, thus exhibiting a systematic 
structural progression as a function of temperature. On the basis of 
representative difference maps at 110, 130 and 173K, we modelled 
three light-induced chromophore structures, denoted L1, L2 and L3, 
respectively (Fig. 3a, b). We further refined the initial models of the L1, 
12 and L3 structures jointly in real space against the NCS-averaged, 
SVD-filtered difference maps at all pump temperatures, and deter- 
mined their relative concentrations at each temperature (Fig. 3g). 

In the L1 structure represented at 110 K, strong positive and nega- 
tive densities are roughly aligned in the plane of ring D of the bilin 
chromophore (Fig. 3a). Negative densities span the pyrrole nitrogen of 
ring D and the side chain of a highly conserved Asp 194 from the 
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Figure 1 | Trap-pump-trap-probe experiment. a, Ribbon diagram of the 
PaBphP-PCM dimer. The BV chromophore is coloured in cyan. 

b, Experimental difference (Fight — Faark) map at 130 K (contoured at +50, 
where o is the standard deviation of difference densities across the entire map). 
Strong positive (green) and negative (red) densities with peak signal greater 
than +120 are clustered near the chromophores of the eight monomers (A-H) 
in the asymmetric unit. c, Dark stripes of a mounted crystal correspond to 
segments from which X-ray data sets were collected. d, A ball-and-stick 
representation of the chromophore in the Pfr state (PDB accession 3NHQ). The 
a-face of a pyrrole ring is defined when atom numbering follows a clockwise 
direction, with B defined as the opposite face. For definition of the o-face of an 
entire bilin chromophore see ref. 30. 
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Figure 2 | Difference maps (Fight — Faark) at pump temperatures between 

100 and 180 K. The BV chromophore (cyan) is shown in the Pfr state. All maps 
are contoured at +4o. Positive (green) and negative (red) densities represent 
structural changes associated with formation of photoproduct state(s) and loss 
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of the parent Pfr state, respectively. No significant difference densities with 
signal greater than +4o are detected at longer range, beyond the 5-A radius 
around the chromophore. 


PXSDIP sequence motif, indicating the rupture of a key hydrogen 
bond that stabilizes ring D in the Pfr state’. The corresponding positive 
densities between His 277 and ring D identify formation of the L1 
structure in which the chromophore has isomerized about the 
Cis=Cy6 double bond to adopt the 15Za configuration, and ring D 
is significantly shifted in its plane towards His 277. As a result, L1 
exhibits a smaller and strained C,4-C,;-—C). bond angle in the methine 
bridge between rings C and D compared to the ‘stretched’ bond angle 
in the Pfr state (Supplementary Fig. 3) and to those of the Pr state in 
Deinococcus radiodurans BphP (DrBphP) and Rhodopseudomonas 
palustris BphP3 (RpBphP3) (refs 4, 10). 

In the L2 structure represented at 130 K, strong difference densities 
near ring C suggest that ring C moves towards its «-face, and that its 
propionate side chain breaks the hydrogen bonds with His 277, 
Tyr 163 and Ser 275 present in the Pfr and L1 structures to form a 
new hydrogen bond with Ser 261 (Figs 3a and 4). Difference densities 
associated with rings C and D (Fig. 3c) suggest that counter-twist 
occurs across the C,; methine bridge in forming L2, in which ring D 
assumes a B-facial disposition relative to ring C. This partially relaxes 
strain in the C,; methine bridge while the C,) methine bridge between 
rings B and C is concomitantly twisted in the opposite direction 
(Supplementary Fig. 4b). 

In the L3 structure represented at 173 K, structural changes extend 
to rings A and B (Fig. 3a). Difference densities sandwiching these rings 
(Fig. 3d) indicate that they move as a unit slightly towards the B-face of 
the chromophore while the C; methine bridge retains its direction of 
twist. Motion is accompanied by small twists in the C; and C;o methine 


Figure 3 | Light-induced structural changes. a, Representative difference 
(Fight ~ Faark) maps at 110, 130 and 173 K. Arrows indicate viewpoints of 
panels c and d. b, Stereo view of the superposition of the chromophore 
conformations in the Pfr (cyan), L1 (magenta), L2 (yellow) and L3 (blue) 
structures. c, Side view of difference map at 130 K (contoured at +30) shows a 
twist in the C,; methine bridge. d, Side view of difference map at 173K 
(contoured at +20) indicates a B-facial shift of rings B/A. Arrows in ¢ and 
d mark the o-face of the chromophore. e, Difference densities at 173 K 
(contoured at +20) near the PXSDIP motif (residues 191-196) at the GAF- 
PHY interface. f, Differences densities associated with the side chains of 

His 277, Tyr 163 and Tyr 190 (map contours: +36 at 173K). g, Relative 
concentrations of the L1, L2 and L3 structures as a function of pump 
temperature. 
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Figure 4 | Light-induced molecular events in PaBphP. Top, the 
chromophores are in ball-and-stick representation, with their surrounding 
residues shown as van der Waals spheres. Green dotted lines indicate potential 
interactions with each cryotrapped structure. Bottom, schematic 


bridges (Supplementary Fig. 4b). Rings B and C are more coplanar in 
L3 than in L2, consistent with a more relaxed C;, methine bridge in L3. 

These data demonstrate that E-to-Z isomerization about the 
Ci5=Cyj6 double bond between rings C and D is the initial structural 
event in the Pfr-to-Pr photoreaction of PaBphP. This is consistent with 
findings in many phytochromes and bacteriophytochromes*''”; but 
contrasts with a recent report based on room temperature NMR spec- 
troscopy, in which rotation of ring A occurs between the Pr and Pfr 
states of an unusual, knotless phytochrome consisting of only the GAF 
domain’’. In PaBphP, when ring D flips into the 15Za configuration 
within a protein cavity that is still optimized to accommodate the 15Ea 
configuration of the Pfr state, a highly distorted and strained C,; 
methine bridge is generated in L1. As L1 evolves to L2, counter-twist 
across the C;; methine bridge partially relieves the strain. Motion of ring 
C results in a Cyo methine bridge twisted in the opposite direction in L2, 
further relaxation of which leads to the L3 structure. These data indicate 
a reaction trajectory that proceeds in the order Pfr—L1—L2— L3 
(Fig. 4). 

To relate these cryotrapped L1, L2 and L3 structures to spectro- 
scopic intermediates in the Pfr-to-Pr photoreaction’*"*, we measured 
visible absorption spectra on a crushed PaBphP-PCM crystal at tem- 
peratures between 100 and 180K using the X-ray experimental pro- 
tocol (Supplementary Fig. 5). Difference absorption spectra between 
illuminated and reference ‘dark’ states show significant loss of the Pfr 
state, indicated by a negative peak at 768 nm, and formation of blue- 
shifted photoproducts (Supplementary Fig. 5b). SVD analysis revealed 
two significant basis difference absorption spectra, with blue-shifted 
peaks between 650 and 700 nm. The basis spectrum with a blue-shifted 
peak at 684 nm is largely populated between 100 and 150K, in agree- 
ment with the temperature dependence of the population of the L2 
structure in X-ray data (Supplementary Fig. 5c and Fig. 3g). Blue- 
shifted absorption peaks are consistent with the distorted tetrapyrrole 
conjugated system observed in L2 and L3. 

The LI, L2 and L3 structures probably arise from early molecular 
events in the Pfr-to-Pr reaction of PaBphP. One or more may corre- 
spond to the Lumi-F spectroscopic intermediate detected on the 
femto- to picosecond timescale by room-temperature time-resolved 
spectroscopy and characterized by blue-shifted absorption peaks’*'*””. 
These structures are consistent with strong difference bands from 
Fourier transform infrared (FTIR) spectroscopy that were attri- 
buted to the carbonyl group of ring D resulting from 15Ea-to-15Za 
isomerization and the B—C methine stretching in cryotrapped Lumi-F 
of Calothrix CphA”’. L1 may be identified with early intermediates in 
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representation of changes in relative disposition of the four pyrrole rings of the 
BV chromophore, in which pyrrole rings A, B, C and D (boxes) are linearly 
connected by methine bridges (circles). The «- and B-faces of the chromophore 
are denoted by arrows. 


Synechocystis Cph1, in which the C,;-H out-of-plane (HOOP) mode 
was detected by ultrafast Raman spectroscopy”. The properties of L1 
are also consistent with the NMR spectroscopic data on the cryo- 
trapped Lumi-F of Cph1, which indicated that the C,,-C,;-C;. angle 
is distorted following 15Ea-to-15Za isomerization’. Furthermore, our 
temperature-scanning range (100-180 K) coincides with the temper- 
ature range in which Lumi-F intermediates were trapped in several 
phytochrome systems'’”*°. Formation of late Meta-F intermediates 
would require higher pump temperatures” to permit more extensive 
structural relaxation in the protein. However, at temperatures > 180 K 
the cryoprotectant solution undergoes a glass transition which causes 
severe deterioration in crystal diffraction®®. 

In both the L2 and L3 structures, the twist of the dihedral angles in the 
C,5 methine bridge is quite distinct from those of the chromophore 
structures in the Pr and Pfr states (Figs 3 and 4 and Supplementary 
Fig. 4b). Both the Pr and Pfr crystal structures of BphPs**"° exhibit an 
a-facial disposition of ring D relative to ring C, which corresponds to a 
negative rotation of the red absorbance band in the circular dichroism 
spectra of BV-containing bacteriophytochromes such as PaBphP, 
DrBphP and Agrobacterium tumefaciens Agpl (refs 21, 22). 
Evidence for such counter-twist events in the methine bridges during 
photoconversion has been presented in Cph1 and Agp1 (refs 19, 22). 
We propose that the BV chromophore of PaBphP generates structural 
signals in response to light via subtle and local twist or counter-twist 
motions in the methine bridges within the confined protein cavity 
(Fig. 4), which alter specific interactions between the pyrrole rings 
and their immediate protein surroundings. It remains to be seen to 
what extent our findings in PaBphP apply to other members of the 
diverse and expanding phytochrome superfamily. 

The protein moiety also has an important role. First, following 
prompt E-to-Z isomerization, steric clashes between ring D and the 
side chain of Asp 194 lead to initial strain in L1 that drives subsequent 
relaxation events. Second, the side chains of His277, Tyr 163 and 
Tyr 190 surrounding ring D also move slightly but concertedly to 
accommodate ring D as the chromophore evolves from Pfr to the L3 
structure (Fig. 3f). Third, Ser261 seems to stabilize L2 and L3 via 
hydrogen bonds to the propionate group of ring C (Fig. 3a, 4). The 
single point mutant $261A inhibits formation of the Pr state upon 
illumination and significantly accelerates dark reversion” (Sup- 
plementary Fig. 6b). This provides further evidence that L2 and L3 
are authentic photoproducts on the productive trajectory towards the 
product Pr state. Fourth, more extensive structural changes occur in L3 
near the highly conserved PXSDIP sequence motif at the interface of 
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the GAF and PHY domains. This segment and the side chain of 
Tyr 190 are pushed away from the chromophore as rings B/A and ring 
D move towards the B-face of the chromophore (Fig. 3e). These move- 
ments expand the chromophore cavity to allow further relaxation, and 
may trigger further structural perturbations between the GAF domain 
and the arm of the PHY domain. As the reaction proceeds further, 
Tyr 163 and Tyr 190 may adjust their side-chain rotamers to reshape 
the ring D pocket to accommodate the product Pr state”’. 

To examine the light dependence of histidine kinase (HK) activity in 
PaBphP, we conducted HK assays on wild type and the $261A mutant 
of full-length PaBphP under light and dark conditions (Supplemen- 
tary Fig. 6) and found that autophosphorylation of PaBphP is light 
dependent, in contrast to earlier observations”. As in R. palustris 
BphP2 (ref. 25) and Cph1 (ref. 26), PaBphP shows higher levels of 
autophosphorylation in the Pr state (here, the light state) than in the 
Pfr state. S261A exhibits a photoconversion phenotype that greatly 
prefers the Pfr state, and shows significantly reduced HK activity 
compared to wild type. We predict that small light-induced structural 
changes, exemplified by the cryotrapped intermediate structures LI, 
L2 and L3, propagate from the chromophore-binding pocket of the 
amino-terminal photosensory domains to the carboxy-terminal HK 
domain via further tertiary and/or quaternary rearrangements”, 
which ultimately affect the HK activity and convert a light signal into 
a biological signal. 


METHODS SUMMARY 


Structural heterogeneity is intrinsic to all dynamic processes, and often 
challenges accurate interpretation of both cryotrapping experiments at 
a single temperature and time-resolved crystallographic data”*”’. This 
work presents a temperature-scan and analytical strategy to resolve 
structural heterogeneity and determine distinct, homogeneous struc- 
tural species. 

We applied a ‘trap—pump-trap-probe’ strategy to photoactive crystals 
of PaBphP-PCM. We first trapped the dark-adapted Pfr state (trap dark) 
by freezing the crystal (grown at 293 K in the dark) in liquid nitrogen 
under safety green/blue light, and collected a reference crystallographic 
‘dark’ data set at 100K for each crystal. The same crystal was then 
uniformly illuminated under white light for 10-15 min at elevated tem- 
peratures to generate reaction intermediates (pump), then cryo-cooled 
to 100 K (trap light) to collect a ‘light’ data set from a fresh crystal volume 
(probe) (Fig. 1c). We calculated 14 (Fight — Faarx) difference maps from 
six crystals that cover 10 pump temperatures between 100 and 180 K. As 
each map contains eight monomers in the asymmetric unit, we obtained 
112 independent difference maps for a PaBphP monomer. We spatially 
aligned these maps on a reference monomer C, and subjected difference 
densities within a 5-A radius of the aligned chromophores to singular 
value decomposition (SVD)’ that identified three significant singular 
values and yielded 112 noise-reduced difference maps (Supplementary 
Fig. 1c, d). Atomic models for the L1, L2 and L3 structures were initially 
built based on representative difference densities at 110, 130 and 173 K, 
respectively. These models were further refined jointly in real space 
(together with a fixed reference Pfr conformation) against SVD-filtered 
difference maps at all pump temperatures. On the basis of least squares 
fitting between the calculated and observed difference densities, we also 
obtained relative concentrations of L1, L2 and L3 at each temperature. 
Supplementary Fig. la summarizes methods and software used in X-ray 
data reduction and analysis. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Purification, mutagenesis and crystallization. PaBphP-PCM containing residues 
1-497 of P. aeruginosa PA4117 was expressed, purified, crystallized and cryo- 
protected in the dark as described”. Site-directed mutagenesis was carried out using 
the QuikChange Site-directed Mutagenesis Kit (Stratagene). Both wild type and 
the 261A mutant of the full-length PaBphP were purified using the same protocol 
as for PaBphP-PCM. 

Data collection and analysis. Diffraction data were collected at the LSCAT 
21-IDG beamline, BioCARS 14-BMC and 14-IDB beamlines of the Advanced 
Photon Source. Temperature was controlled by a cryostream cooler (Oxford 
Cryosystems). Illumination was applied to a mounted crystal using two unfiltered 
fibre optic lights from different directions for 10-15 min, while the crystal rotated 
around the spindle axis of the goniometer at 30°s '. All diffraction images were 
processed using HKL2000*". Light and dark data sets from the same crystal were 
scaled using ScalelIt in CCP4 (ref. 32; Supplementary Table 1). A reference ‘dark’ 
structure in the Pfr state was refined at 2.55-A resolution using Phenix™ 
(Supplementary Table 2), and was used to calculate phases in generating difference 
Fourier maps by FFT in CCP4. Difference maps were masked and aligned using 
NCSMASK and MAPROT of CCP4, and were further subjected to SVD” analysis 
in DynamiX (Supplementary Fig. 1). Real space refinement of the L1, L2 and L3 
structures, including the chromophore and several adjacent residues (Gln 188- 
Asp 194, His 277, Ser 261, Tyr 163 and the covalent chromophore anchor Cys 12), 
was carried out against NCS-averaged, SVD-filtered difference maps using 


Dynamix (Supplementary Table 3). All modelling building was carried out using 
Coot™*. Structure figures were generated using PyMol (http://pymol.org) 
Ultraviolet-visible spectroscopy. Absorption spectra of proteins in solution were 
recorded at room temperature (25 + 2 °C) with a Shimadzu UV-1650 PC spec- 
trophotometer. Visible spectra of crystals were measured using a microspectro- 
photometer Xspectra (4DX-ray Systems) at BioCARS. 

HK autophosphorylation assay. The HK assay was modified based on published 
protocols*®*°, Reactions under light (continuous illumination with unfiltered fibre 
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ATM controls meiotic double-strand-break formation 


Julian Lange’, Jing Pan'+, Francesca Cole’, Michael P. Thelen®, Maria Jasin? & Scott Keeney'* 


In many organisms, developmentally programmed double-strand 
breaks (DSBs) formed by the SPO11 transesterase initiate meiotic 
recombination, which promotes pairing and segregation of homo- 
logous chromosomes’. Because every chromosome must receive a 
minimum number of DSBs, attention has focused on factors that 
support DSB formation’. However, improperly repaired DSBs can 
cause meiotic arrest or mutation®**; thus, having too many DSBs is 
probably as deleterious as having too few. Only a small fraction of 
SPO11 protein ever makes a DSB in yeast or mouse’ and SPO11 
and its accessory factors remain abundant long after most DSB 
formation ceases’, implying the existence of mechanisms that 
restrain SPO11 activity to limit DSB numbers. Here we report that 
the number of meiotic DSBs in mouse is controlled by ATM, a 
kinase activated by DNA damage to trigger checkpoint signalling 
and promote DSB repair. Levels of SPO11-oligonucleotide com- 
plexes, by-products of meiotic DSB formation, are elevated at least 
tenfold in spermatocytes lacking ATM. Moreover, Atm mutation 
renders SPO11-oligonucleotide levels sensitive to genetic mani- 
pulations that modulate SPO11 protein levels. We propose that 
ATM restrains SPO11 via a negative feedback loop in which kinase 
activation by DSBs suppresses further DSB formation. Our find- 
ings explain previously puzzling phenotypes of Atm-null mice 
and provide a molecular basis for the gonadal dysgenesis observed 
in ataxia telangiectasia, the human syndrome caused by ATM 
deficiency. 

SPO11 creates DSBs via a covalent protein-DNA intermediate that 
is endonucleolytically cleaved to release SPO11 attached to a short 
oligonucleotide, freeing DSB ends for further processing and recom- 
bination® (Fig. 1a). SPO11-oligonucleotide complexes are a quantitat- 
ive by-product of DSB formation that can be exploited to study DSB 
number and distribution®’ (Supplementary Fig. 1). We examined 
SPO11-oligonucleotide complexes by SPO11 immunoprecipitation 
and 3’-end labelling of whole-testis extracts from Atm mutant mice, 
which have multiple catastrophic meiotic defects, including chro- 
mosome synapsis failure and apoptosis’. The Atm” phenotype 
resembles that of mutants lacking DSB repair factors such as DMC1, 
indicating that absence of ATM causes meiotic recombination defects. 
Although Spoll”’ ~ mutation is epistatic to Atm (refs 11, 12), the 
functional relationship between ATM and SPO11 is complex, as 
meiotic defects of Atm” mice are substantially rescued by reducing 
Spoll gene dosage’*"* (discussed later). 

Unexpectedly, we found that adult Atm” testes exhibited an 
approximately tenfold elevation in steady-state levels of SPO11- 
oligonucleotide complexes relative to wild-type littermates (Fig. 1b) 
(11.3 + 4.5-fold, mean and standard deviation, n = 7 littermate pairs). 
This finding contrasts with Dmcl~“~ testes, which showed a ~50% 
reduction in SPO11-oligonucleotide complexes (0.51 + 0.06-fold rela- 
tive to wild type, n = 5) (Fig. 1c), as previously shown”’. The mutants 
share similar arrest points in prophase I, as determined by molecular 
and histological data!’; thus, increased SPO11-oligonucleotide com- 
plexes in Atm” spermatocytes are not an indirect consequence 


of arrest or of an increased fraction of meiocytes harbouring such 
complexes. 

In Atm testes, levels of free SPO11 (that is, not bound to an 
oligonucleotide) were much lower than in wild type (Fig. 1b). This is 
not because a large fraction of SPO11 has been consumed in covalent 
complexes with DNA—which alters its electrophoretic mobility—as 
free SPO11 was not restored to wild-type levels by nuclease treatment 
(Fig. 1d). Instead, because Spol1 transcript levels in wild type are 
highest in later stages of meiotic prophase’>’, after the arrest point 
of Atm” cells, reduced free SPO11 is attributable to the lack of later 
meiotic cell types, consistent with the reduced free SPO11 also found in 
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Figure 1 | SPO11 activity and expression in the absence of ATM. a, SPO11 
attacks the DNA phospohodiester backbone, forming a covalent intermediate 
with the 5’ strand termini of the DSB. Endonucleolytic cleavage removes 
SPO11 covalently attached to an oligonucleotide. b, c, Steady-state levels of 
SPO11-oligonucleotide (SPO11-oligo) complexes are elevated in Atm“ testes 
(b), but are decreased in Dmc1~ testes (c). Anti-mSPO11, anti-mouse SPO11 
antibody. SPO11 immunoprecipitates from extracts of whole adult testes were 
treated with terminal transferase and [o-*?P] dCTP, resolved by SDS-PAGE, 
and transferred to a membrane. Representative experiments using littermates 
of the indicated genotypes are shown. Top, autoradiograph. Bottom, anti- 
SPO11 western blot detection. Vertical lines, extent of SPO11-specific signals; 
and 8, major SPO11 isoforms; asterisk, non-specific terminal transferase 
labelling; arrowheads, migration position of immunoglobulin heavy chain. 

d, Treatment of labelled SPO11 immunoprecipitates with benzonase does not 
detectably alter levels of free SPO11, but this sequence non-specific nuclease 
efficiently removes the 3’-end label (compare lanes + benzonase), and was 
previously shown to completely remove DNA covalently bound to yeast Spol1 
(ref. 1). 
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Dmcl~ cells (Fig. 1c). As expected, the residual SPO11 protein in 
Atm”, like Dmcl~, testes was mostly SPO11f (Fig. 1b, c). SPO11a 
and SPO11 are major protein isoforms encoded by developmentally 
regulated splice variants; SPO11f is expressed earlier and is sufficient 
for nearly normal DSB levels*'*"”~°. 

Elevated SPO11-oligonucleotide complexes can be explained by an 
increased number of meiotic DSBs and/or a longer lifespan of com- 
plexes. To distinguish between these possibilities, we examined the 
initial appearance and persistence of SPO11-oligonucleotide com- 
plexes in juvenile mice, in which the first suite of spermatogenic cells 
proceeds through meiosis in a semi-synchronous fashion’. First, we 
assayed SPO11-oligonucleotide complexes in whole-testis extracts 
from wild-type C57BL/6] mice at postnatal days (d)7 to 24 (Fig. 2a). 
SPO11-oligonucleotide complexes first appeared between d9 and d10, 
when most cells of the initial cohort had entered leptonema. SPO11- 
oligonucleotide complexes persisted or increased slightly until d15, 
when the first cohort had progressed into pachynema. Levels rose still 
further from d16 to d18, coincident with the second cohort of sperma- 
togenic cells reaching leptonema*’. Thus, SPO11-oligonucleotide 
complexes appear at the same time as cell types that experience the 
majority of meiotic DSBs. Consistent with findings in mutants (see 
earlier), only trace amounts of free SPO11 protein were seen when 
SPO11-oligonucleotide complexes first appeared, with SPO11f the 
predominant isoform at these times (Fig. 2a). Importantly, SPO11- 
oligonucleotide complex levels did not decline between the first and 
second spermatogenic cohorts. We infer that the lifespan of the com- 
plexes is long relative to the duration of prophase, and that an 
increased lifespan is not a likely explanation for the large increase in 
steady-state SPO11-oligonucleotides in adult Atm testes. 

In support of this interpretation, we found that SPO11-oligonucleotide 
complexes were undetectable in Atm™~~ testes at d7 (data not shown) 
but were already elevated 3.3-fold compared with a wild-type littermate 
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Figure 2 | SPO11-oligonucleotide complexes from juvenile mice. 

a, SPO11-oligonucleotide complexes from testes of wild-type mice from d7- 
d24. Top, autoradiograph. Middle, quantification. Bottom, anti-SPO11 western 
detection. b, SPO11-oligonucleotides are elevated in testes from juvenile 
Atm mice. Top, autoradiograph. Bottom, anti-SPO11 western detection. 
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when they first appeared, increasing to 8.4-fold over wild type by 
dl2 (Fig. 2b). Because Atm’ juveniles showed higher SPO11- 
oligonucleotide levels as soon as the first leptotene cells appeared, we 
conclude that most, if not all, of the increase reflects a greater number 
of meiotic DSBs occurring during prophase I. 

Meiotic defects of mice lacking ATM are substantially suppressed by 
reducing Spo11 gene dosage: Spol1*’” Atm” spermatocytes pair and 
recombine their autosomes and progress through meiotic prophase to 
metaphase I, where they arrest due to a failure in sex chromosome 
pairing and recombination’**. The reason for this puzzling rescue was 
unknown, but our current findings suggest an explanation: the majority 
of meiotic defects in Atm-null spermatocytes are caused by grossly 
elevated DSB levels, which are lowered by Spo11 heterozygosity (which 
reduces SPO11 protein levels by half in adult and juvenile testes (ref. 17 
and our unpublished data)). Indeed, we found SPO11-oligonucleotide 
complexes in Spol1*/” Atm” mice to be substantially reduced com- 
pared with Atm littermates (Fig. 3a). The remaining increase in 
SPO11-oligonucleotide complexes in Spoll*’” Atm’ mutants com- 
pared with wild type (range of 4.5- to 7.8-fold, n = 2) is not simply a 
consequence of metaphase arrest, because SPO11-oligonucleotide 
complexes were not elevated in mice that exhibit a similar arrest point 
due to absence of MLH1, a protein involved late in recombination” 
(Fig. 3a). The fact that DSBs are still elevated in Spol1 Atm 
spermatocytes relative to wild type may account for some or all of 
the remaining defects in this mutant, including axis interruptions at 
sites of ongoing recombination and persistent unrepaired DSBs late in 
prophase I (ref. 14). 

Our findings indicate that the absence of ATM renders the extent of 
DSB formation sensitive to SPO11 expression levels. Therefore, we 
reasoned that increasing SPO11 expression should further elevate 
DSB formation in ATM-deficient cells. To test this prediction, we used 
a previously described transgene (Xmr-Spol1f,) that expresses the 
SPO11f isoform". Indeed, there was substantial further elevation of 
SPO11-oligonucleotide complex levels (20.9 + 1.5-fold over wild-type 
littermates, n = 3) upon introduction of this transgene in an Atm-null 
background with intact endogenous Spol1 (Fig. 3b). By contrast, the 
transgene resulted in only a modest increase in SPO1 1-oligonucleotide 
complexes in an ATM-proficient background (1.1 + 0.05-fold, n = 3) 
(Fig. 3b). 

SPO11-oligonucleotide complexes from Atm-null testes were con- 
sistently shifted to a higher electrophoretic mobility compared to wild 
type or other mutants (Figs 1, 2b and 3). To examine the distribution of 
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Figure 3 | Spo11 gene dosage modulates SPO11-oligonucleotide complex 
levels in Atm-deficient spermatocytes. a, SPO11-oligonucleotide complexes 
are reduced in Spol1 *” Atm“ testes relative to Atm’, but are more abundant 
than in wild type or in an M/hI~~ mutant, which similarly arrests at metaphase. 
Consistent with further meiotic progression than Atm”, both SPO11 isoforms 
(% and ) are expressed in Spol1'’” Atm~ testes, although at reduced levels 
due to Spo11 heterozygosity. Dagger, lower-mobility polypeptides probably 
originating from the Spo11 knockout allele’’. b, SPO11-oligonucleotide 
complexes are further elevated by SPO11f expression from the Xmr-Spol1P, 
transgene (Tg Spol1f)'* in Atm” spermatocytes. Introducing this transgene 
into an otherwise wild-type background only modestly increased SPO11- 
oligonucleotide complex levels. 
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oligonucleotide lengths, labelled complexes were protease-digested 
and the resulting oligonucleotides were electrophoresed on a high- 
resolution gel (Fig. 4a). As previously shown’, SPO11-oligonucleotides 
from wild type have a bimodal length distribution with prominent 
subpopulations at apparent sizes of ~15-27 and ~31-35 nucleotides. 
Atm’ mice showed a different pattern with or without the Spol1 
transgene: oligonucleotides in the shorter size range were less abund- 
ant relative to the ~31-35 nucleotide class and longer oligonucleotides 
appeared, including an abundant class of ~40-70 nucleotides and a 
subpopulation that ranged to >300 nucleotides. Spo11 *” Atm“ mice 
showed an intermediate pattern, with more pronounced enrichment of 
the ~31-35 nucleotide class relative to both smaller and longer oligo- 
nucleotides. These results indicate that ATM influences an early step in 
nucleolytic processing of meiotic DSBs, as has been proposed in yeast”’. 
In principle, altered oligonucleotide sizes could reflect changes in pre- 
ferred positions of the endonucleolytic cleavage that releases the 
SPO11-oligonucleotide complex, effects on 3’—>5' exonucleolytic 
digestion of SPO11-oligonucleotides after they are formed, or occur- 
rence of SPO11-induced DSBs at adjacent positions on the same DNA 
duplex (M. Neale, personal communication). Resection defects and 
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Figure 4 | Roles of ATM in DSB formation and processing. a, SPO11- 
oligonucleotide length distribution is altered in Atm’ spermatocytes. End- 
labelled SPO11-oligonucleotide complexes were treated with protease to digest 
the bound protein before electrophoresis on denaturing PAGE. Left, 
autoradiograph. Right, background-subtracted lane traces normalized to total 
signal within each lane. Asterisk, autoradiograph background. Each lane 
contains SPO11-oligonucleotides from the equivalent of different numbers of 
mice in order to better compare sizes: Atm*’*, 15 mice; Atm”, 2 mice; 
Spoll *” Atm’, 4 mice; Atm” plus transgene, 2 mice; Spoll”’ ~, 2 mice; mock, 
15 wild-type mice. nt, nucleotides. b, Negative feedback loop by which ATM 
regulates meiotic DSB levels. DSBs generated by SPO11 activate the ATM 
kinase, inhibiting further DSB formation. ATM may also have roles in repair of 
DSBs by homologous recombination; for example, by promoting DSB end 
resection. 
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adjacent DSBs (which conventional cytology would be unable to 
resolve) are both possible explanations for why SPO11-oligonucleotide 
complexes in Atm” spermatocytes show a greater increase than 
RAD51 focus numbers"*. 

Our results reveal an essential but previously unsuspected function 
for ATM in controlling the number of SPO11-generated DSBs. We 
suggest that activation of ATM by DSBs triggers a negative feedback 
loop that leads to inhibition of further DSB formation (Fig. 4b) via 
phosphorylation of SPO11 or its accessory proteins, several of which 
are known to be phosphorylated in budding yeast (for example, ref. 24) 
and are conserved in mammals’. ATM is activated in the vicinity of 
DSBs, as judged by SPO11- and ATM-dependent appearance of 
yH2AX (phosphorylated histone variant H2AX) on chromosomes at 
leptonema’****. Thus, we envision that the negative feedback loop 
operates at least in part at a local level, perhaps discouraging additional 
DSBs from forming close to where a DSB has already formed. Such a 
mechanism could minimize instances where both sister chromatids 
are cut in the same region, and could also promote more even spacing 
of DSBs along chromosomes. These studies provide a new molecular 
framework for understanding the gonadal phenotypes of patients with 
ataxia telangiectasia”®, which is caused by ATM deficiency”’. 


METHODS SUMMARY 


Mouse mutant alleles and the Spol 1 transgene were previously described'®"*”*°. 
Experimental animals were compared with controls from the same litter. 
Experiments conformed to regulatory standards and were approved by the 
MSKCC Institutional Animal Care and Use Committee. For measurement of 
SPO11-oligonucleotide complexes, both testes from each mouse were used per 
experiment, that is, littermate comparisons were made on a per-testis basis 
(Supplementary Fig. 1). Testis extract preparation, immunoprecipitation and 
western blot analysis were performed essentially as described’. Radiolabelled spe- 
cies were quantified with Fuji phosphor screens and ImageGuage software. The 
anti-mouse SPO11 monoclonal antibody was produced from hybridoma cell line 
180 (M.P.T., unpublished data). The size distribution of SPO11-oligonucleotides 
was determined essentially as described’ after radiolabelling with [-**P] cordycepin. 
Benzonase treatment of SPO11-oligonucleotide complexes followed manufacturer's 
instructions (Novagen). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mouse mutant alleles and the Spo11/ transgene were previously describe 
Experiments conformed to regulatory standards and were approved by the 
MSKCC Institutional Animal Care and Use Committee. For measurement of 
SPO11-oligonucleotide complexes, both testes from each mouse were used per 
experiment, that is, littermate comparisons were made on a per-testis basis 
(Supplementary Fig. 1). The anti-mouse SPO11 monoclonal antibody was pro- 
duced from hybridoma cell line 180 (M.P.T., unpublished data). 

Testis extract preparation, immunoprecipitation and western blot analysis were 
performed essentially as described’. Testes were decapsulated, then lysed in 800 pl 
lysis buffer (1% Triton X-100, 400 mM NaCl, 25 mM HEPES-NaOH at pH 7.4, 
5mM EDTA). Lysates were centrifuged at 100,000 r.p.m. (355,040g) for 25 min in 
a TLA100.2 rotor. Supernatants were incubated with anti-mouse SPO11 antibody 
180 (541g per pair of testes) at 4°C for 1h, followed by addition of 30-40 pl 
protein-A-agarose beads (Roche) and incubation for another 3h. Beads were 
washed three times with IP buffer (1% Triton X-100, 150mM NaCl, 15mM 
Tris-HCl at pH 8.0). Immunoprecipitates were eluted with Laemmli sample buffer 
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and diluted six- to sevenfold in IP buffer. Eluates were incubated with additional 
anti-mouse SPO11 antibody 180 at 4°C for 1h, followed by addition of 30-40 ul 
protein-A~agarose beads and incubation at 4 °C overnight. Beads were washed 
three times with IP buffer and twice with buffer NEB4 (New England BioLabs). 
SPO11-oligonucleotide complexes were radiolabelled at 37°C for 1h using 
terminal deoxynucleotidyl transferase (Fermentas) and [o-**P] dCTP. Beads were 
washed three times with IP buffer, boiled in Laemmli sample buffer and fractionated 
on 8% SDS-PAGE. Complexes were transferred to a PVDF membrane by semi-dry 
transfer (Bio-Rad). Radiolabelled species were detected and quantified with Fuji 
phosphor screens and ImageGuage software. For western blot analysis, membranes 
were probed with anti-mouse SPO11 antibody 180 (1:2,000 in PBS containing 0.1% 
Tween 20 and 5% non-fat dry milk), then horseradish-peroxidase-conjugated 
protein A (Abcam; 1:10,000 in PBS containing 0.1% Tween 20 and 5% non-fat 
dry milk), and detected using the ECL+ reagent (GE Healthcare). The size distri- 
bution of SPO11 oligonucleotides was determined by radiolabeling with [«-**P] 
cordycepin then protease digestion followed by denaturing PAGE. Benzonase 
treatment of SPO11-oligonucleotide complexes was performed as per manufac- 
turer’s instructions (Novagen). 
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Verbal and non-verbal intelligence changes in the 


teenage brain 


Sue Ramsden!, Fiona M. Richardson!, Goulven Josse!, Michael S. C. Thomas’, Caroline Ellis, Clare Shakeshaft', 


Mohamed L. Seghier! & Cathy J. Price! 


Intelligence quotient (IQ) is a standardized measure of human 
intellectual capacity that takes into account a wide range of 
cognitive skills’. IQ is generally considered to be stable across the 
lifespan, with scores at one time point used to predict educational 
achievement and employment prospects in later years’. Neuro- 
imaging allows us to test whether unexpected longitudinal fluctua- 
tions in measured IQ are related to brain development. Here we 
show that verbal and non-verbal IQ can rise or fall in the teenage 
years, with these changes in performance validated by their close 
correlation with changes in local brain structure. A combination of 
structural and functional imaging showed that verbal IQ changed 
with grey matter in a region that was activated by speech, whereas 
non-verbal IQ changed with grey matter in a region that was 
activated by finger movements. By using longitudinal assessments 
of the same individuals, we obviated the many sources of variation 
in brain structure that confound cross-sectional studies. This 
allowed us to dissociate neural markers for the two types of IQ 
and to show that general verbal and non-verbal abilities are closely 
linked to the sensorimotor skills involved in learning. More 
generally, our results emphasize the possibility that an individual’s 
intellectual capacity relative to their peers can decrease or increase 
in the teenage years. This would be encouraging to those whose 
intellectual potential may improve, and would be a warning that 
early achievers may not maintain their potential. 

An individual's abilities and capacity to learn can be partly captured 
by the use of verbal and non-verbal (henceforth performance) 
intelligence tests. IQ provides a standardized method for measuring 
intellectual abilities and is widely used within education, employment 
and clinical practice. In the absence of neurological insult or de- 
generative conditions, IQ is usually expected to be stable across life- 
span, as evidenced by the fact that IQ measurements made at different 
points in an individual’s life tend to correlate well'*. Nevertheless, 
strong correlations over time disguise considerable individual vari- 
ation; for example, a correlation coefficient of 0.7 (which is not unusual 
with verbal IQ) still leaves over 50% of the variation unexplained. The 
study that we report here tested whether variation in a teenager’s IQ 
over time correlated with changes in brain structure. If it did, this 
would provide construct validity for the increase or decrease of IQ 
in the teenage years, because if IQ changes correspond to structural 
brain changes then they are unlikely to represent measurement error in 
the IQ tests. In addition, if verbal and performance skills change at 
different rates in different individuals, the neural markers for verbal 
and performance IQ changes could in principle be dissociated. This 
would overcome two of the challenges faced by previous studies of 
between-subject variability in IQ measures at a given time point: verbal 
and performance IQ are tightly correlated in individuals, so it has 
been hard to identify neural structures corresponding to each**; and 
there are many sources of between-subject variance in brain structure 
(for example gender, age, size and handedness) that hide the relevant 
differences. 


Our participants were 33 healthy and neurologically normal 
adolescents with a deliberately wide and heterogeneous mix of abilities 
(see Supplementary Information for details and the implications of our 
sampling for the generalizability of our conclusions). They were first 
tested in 2004 (‘time 1”) when they were 12-16 yr old (mean, 14.1 yr). 
Testing was repeated in 2007/2008 (‘time 2’) when the same indivi- 
duals were 15-20 yr old (mean, 17.7 yr). See Table 1 for further details 
of the participants. During the intervening years, there were no testing 
sessions, and participants and their parents had no knowledge that 
they would be invited back for further testing. On both test occasions, 
each participant had a structural brain scan using magnetic resonance 
imaging (MRI) and had their IQ measured using the Wechsler 
Intelligence Scale for Children (WISC-III) at time 1 and the Wechsler 
Adult Intelligence Scale (WAIS-III) at time 2 (see Supplementary 
Information for details). These two widely used, age-appropriate assess- 
ments” produce strongly correlated results at a given time point, con- 
sistent with them measuring highly similar constructs®. Scores on 
individual subtests are standardized against age-specific norms and 
then grouped to produce separate measures of verbal IQ (VIQ) and 
performance IQ (PIQ), with VIQ encompassing those tests most related 
to verbal skills and PIQ being more independent of verbal skills. 
Nevertheless, VIQ and PIQ scores are very significantly correlated with 
each other across participants: in our sample, the correlations between 
VIQ and PIQ were r= 0.51 at time 1 and r= 0.55 at time 2 (in both 
cases, n = 33; P<0.01). Full-scale IQ (FSIQ) is the composite of VIQ 
and PIQ and is regarded as the best measure of general intellectual 
capacity (the g factor) that has previously been shown to correlate with 
brain size and cortical thickness in a wide variety of frontal, parietal and 
temporal brain regions”*. 

The wide range of abilities in our sample was confirmed as follows: 
FSIQ ranged from 77 to 135 at time 1 and from 87 to 143 at time 2, with 
averages of 112 and 113 at times 1 and 2, respectively, and a tight 
correlation across testing points (r= 0.79; P< 0.001). Our interest 
was in the considerable variation observed between testing points at 
the individual level, which ranged from —20 to +23 for VIQ, —18 to 
+17 for PIQ and —18 to +21 for FSIQ. Even if the extreme values of 
the published 90% confidence intervals are used on both occasions, 
39% of the sample showed a clear change in VIQ, 21% in PIQ and 33% 
in FSIQ. In terms of the overall distribution, 21% of our sample showed 


Table 1 | Participants’ details 


Datum Age FSIQ VIQ PIQ 
Time 1 Mean (s.d.) 14.1(1.0) 112(13.9) 113(15.1) 108 (12.3) 
Min, max 12.6,16.5 77,135 84, 139 74, 137 
Time 2 Mean (s.d) 17.7(1.0) 113(14.0) 116(18.0) 107 (9.6) 
Min, max 15.9,20.2 87,143 90,150 83,124 
Correlation* r — 0.792+ 0.809} 0.589+ 
Change Mean (s.d.) 3.5(0.2) +1.0(9.0) +3.0 (10.6)—1.0 (10.2) 


(time 2—time1) Min,max 3.3,3.9 


* Correlation coefficient between scores at times 1 and 2. +P<0.01. 
n= 33 (19 male, 14 female). s.d., standard deviation. 


—18,+21 —20,+23 —18,+17 
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a shift of at least one population standard deviation (15) in the VIQ 
measure, and 18% in the PIQ measure. However, only one participant 
had a shift of this magnitude in both measures, and, for that particip- 
ant, one measure showed an increase and the other a decrease. This 
pattern is reflected in the absence of a significant correlation between 
the change in VIQ and the change in PIQ. The independence of 
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Figure 1 | Location of brain regions where grey matter changed with VIQ 
and PIQ. a, Correlation between change in grey matter density and change in 
VIQ (yellow) in the left motor speech region (peak in the left precentral gyrus at 
x 47 mm, y = —9mm, z= +30 mm, measured in Montreal Neurological 
Institute (MNI) space, with a Z-score of 5.2 and 681 voxels at P< 0.001). The 
corresponding effect on volume was slightly less significant (Z-score, 3.5; 118 
voxels at P< 0.001). b, Correlation between change in PIQ (red) and change in 
grey matter density in the anterior cerebellum (peak at x = +6 mm, 

y = —46 mm, z = +3 mm, in MNI space, with a Z-score of 3.9 and 210 voxels at 
P<0.001). Both effects were significant at P< 0.05 after familywise error 
correction for multiple comparisons in extent based on the number of voxels in 
a cluster that survived P< 0.001 uncorrected. In addition, the VIQ effect was 
significant at P< 0.05 after familywise error correction for multiple 
comparisons in height. The statistical threshold used in the figure (P < 0.001) 
illustrates the extent of the effects. Plots show the change in grey matter density 
versus the change in both VIQ and PIQ at the voxel with the highest Z-score in 
the appropriate region. Linear regression lines are shown for significant 
correlations. Changes in the motor speech region correlated with changes in 
VIQ but not changes in PIQ, whereas changes in the anterior cerebellum 
correlated with changes in PIQ but not changes in VIQ (P< 0.001). n = 33; 
GMD, grey matter density. 
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changes in these two measures allows us to investigate the effect of 
each without confounding influences from the other. 

To test whether the observed IQ changes were meaningfully 
reflected in brain structure, we correlated them with changes in local 
brain structure. This within-subject correlation obviates the many 
possible sources of between-subject variance and may have sensitized 
our analysis to neural markers of VIQ and PIQ that have not previ- 
ously been revealed. Given the distributed nature of brain regions 
associated with between-subject differences in FSIQ”’, regions of 
interest were not used in this analysis, and the results of the whole- 
brain analysis were only considered to be significant at P< 0.05 after 
familywise error correction in either height (peak signal at a single 
voxel) or extent (number of voxels that were significant at P< 0.001). 

Using regression analysis, we studied the brain changes associated 
with a change in VIQ, PIQ or FSIQ (see Methods Summary for details). 
The results (Fig. 1) showed that changes in VIQ were positively corre- 
lated with changes in grey matter density (and volume) in a region of 
the left motor cortex that is activated by the articulation of speech’. 
Conversely, changes in PIQ were positively correlated with grey matter 
density in the anterior cerebellum (lobule IV), which is associated with 
motor movements of the hand'*”’. Post hoc tests that correlated struc- 
tural change with change in each of the nine VIQ and PIQ subtest 
scores that were common in the WISC and WAIS assessments found 
that the neural marker for VIQ indexed constructs that were shared by 
all VIQ measures and that the neural marker for PIQ indexed con- 
structs that were common to three of the four PIQ measures (Table 2). 
This indicates that our VIQ and PIQ markers indexed skills that were 
not specific to individual subtests. There were no other grey or white 
matter effects that reached significance in a whole-brain structural 
analysis of VIQ, PIQ or FSIQ. See Supplementary Information for 
details of further post hoc tests. 

Our findings that VIQ changes were related to a motor speech 
region and PIQ changes were related to a motor hand region are 
consistent with previous claims that cognitive intelligence is partly 
dependent on sensorimotor skills'?'*. Using functional imaging in 
the same 33 participants performing a range of sensory, motor and 
language tasks, we confirmed that the left motor speech region iden- 
tified in the VIQ structural analysis was more activated by articulation 
tasks (naming, reading and saying “one, two, three”) than by semantic 
or perceptual tasks that required a finger press response (see 
Supplementary Information for details). In contrast, a region very 
close to the anterior cerebellum region identified in the PIQ structural 
analysis was more activated during tasks involving finger presses than 
during tasks involving articulation. Figure 2 shows these results at both 
the group level and the individual level. The locations of the grey 


Table 2 | Correlations between grey matter density and score 


Test type Motor speech Anterior 
region (r) cerebellum (1) 
Verbal tests Vocabulary 0.284* 0.142 
Similarities 0.438+ —0.021 
Arithmetic 0.477+ 0.304% 
Information 0.314t 0.185 
Comprehension 0.541+ 0.183 
Non-verbal tests Picture completion 0.038 0.363t 
Digit symbol coding 0.003 —0.028 
Block design 0.000 0.306t 
Picture arrangement 0.126 0.437+ 


* Trend (one-tailed) at P= 0.0545. +Significant (one-tailed) at P< 0.01. {Significant (one-tailed) at 
P<0.05. 

Correlations were calculated using changes in scaled (that is, age-adjusted) scores in the various 
subtests that were common to both the WISC and the WAIS. The change in grey matter density in the 
motor speech region correlated significantly with changes in scores in four of the five verbal subtests, 
and there was a near-significant trend in the fifth but it did not correlate significantly with changes in 
scores in any of the four tests that comprise PIQ. Conversely, the change in grey matter density in the 
anterior cerebellum correlated significantly with changes in scores in three of the four tests that 
comprise PIQ (the exception being the digit symbol coding test, which has a particular loading on 
processing speed) but correlated with changes in scores in only one of the verbal tests (the arithmetic 
test, which probably has the smallest verbal component of the verbal tasks). 
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Figure 2 | Functional activations in the regions identified by the structural 
analysis. The motor speech region was more activated by articulation tasks 
than by finger press tasks (x 48mm, y 10mm, z= +30mm (MNI); 
t = 14.7; P< 0.05 familywise-error-corrected for multiple comparisons across 
the whole brain), and corresponds to the region identified in the structural 
analysis for VIQ. These effects were consistently observed at the same 
coordinates for all individual subjects. In the three exemplar participants shown 
here (P1, P2, P3), the Z-scores were 3.9, 3.5 and 3.0, respectively. The anterior 
cerebellum region was more activated during finger presses than during 
articulation at both the group level (peak at x = +6 mm, y = —48 mm, 

z= —4mm (MNI); Z-score, 3.7; 216 voxels at P< 0.001 corrected for multiple 
comparisons in extent) and the individual level (P1: x = +12 mm, 

y = —48 mm, z= +2 mm (MNI); Z-score, 3.7; P2: x = +6mm, y = —50mm, 
z= —6mm (MNI); Z-score, 3.3; P3: x = +12mm, y 46mm, z= +2mm 
(MNI); Z-score, 4.9). In all cases, the activation peaks were identified from 
whole-brain analyses and the peak effects for the correlation with structure are 
illustrated with blue cross hairs in both the structural results and the functional 
results. This illustrates that the location of the structural effects is within the 
regions identified by the functional effects. 


matter changes associated with VIQ and PIQ changes do not corre- 
spond to the anterior frontal and parietal regions associated with 
general intelligence’ (g factor). It may therefore be the case that g 
remains relatively constant across ages, but changes in the ability to 
perform individual subtests depend on changes in sensorimotor skills. 
It is also notable that although completion of the subtests comprising 
verbal and performance measures must implicate a network of brain 
regions, only structural changes in regions associated with sensorimo- 
tor skills showed correlations with changes in VIQ and PIQ. 

The changes in brain structure that correlated with changes in IQ 
allow us to explain some of the variance in terms of brain development. 
Specifically, 66% of the variance in VIQ at time 2 was accounted for by 
VIQ at time 1, a further 20% was accounted for by the change in grey 
matter density in the left motor speech region, with the remaining 14% 
unaccounted for. Similarly, 35% of the variance in PIQ at time 2 was 
accounted for by PIQ at time 1, with 13% accounted for by the change 
in grey matter density in the anterior cerebellum, leaving 52% un- 
accounted for. Future studies may be able to account for more of the 
between-subject variability by using a similar methodology with larger 
samples or other methodologies that measure structural or functional 
connectivity*””. 
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Our findings demonstrate considerable effects of brain plasticity in 
our sample during the teenage years, over and above normal develop- 
ment. By obviating the many sources of between-subject variance and 
controlling for global changes in brain structure, our within-subject 
analysis has allowed us to dissociate brain regions where structure reflects 
individual differences in verbal or non-verbal performance, in a way that 
has proved difficult in previous studies using behavioural data from a 
single point in time. We have also shown that the changes observed over 
time in the IQ scores of teenagers cannot simply be measurement error, 
because they correlate with independently measured changes in brain 
structure in regions that are plausibly related to the verbal and non- 
verbal functions tested. Further studies are required to determine the 
generalizability of this finding; for example, the same degree of plasticity 
may be present throughout life or the adolescent years covered by this 
study may be special in this regard. In addition, future work could 
consider the causes of the identified changes both in intelligence and 
in brain structure and how they impact on educational performance and 
employment prospects. The implication of our present findings is that an 
individual’s strengths and weaknesses in skills relevant to education and 
employment are still emerging or changing in the teenage years. 


METHODS SUMMARY 


This study was approved by the Joint Ethics Committee of the Institute of 
Neurology and the National Hospital for Neurology and Neurosurgery, 
London, UK. All structural and functional scans at times 1 and 2 were acquired 
from the same Siemens 1.5T Sonata MRI scanner (Siemens Medical Systems). The 
structural images were acquired using a T1-weighted modified driven equilibrium 
Fourier transform sequence with 176 sagittal partitions and an image matrix of 
256 X 224, yielding a final resolution of 1 mm? (repetition time, 12.24 ms; echo 
time, 3.56 ms; inversion time, 530 ms). To pre-process the 66 structural images (33 
participants X 2 time points), we used SPMB8 (http://www.fil.ion.ucl.ac.uk/spm) 
with the DARTEL toolbox to segment and spatially normalize the brains into the 
same template, with and without modulation. Modulated images incorporate a 
measure of local brain volume, whereas unmodulated images, used with propor- 
tional scaling to correct for global grey matter, provide a measure of regional grey 
matter density. Previous studies*”** have shown that the correlations between 
brain structure and cognitive ability are better detected by grey matter density. 
Coordinates for each voxel were converted to standard MNI space. Images were 
smoothed using a Gaussian kernel with an isotropic full-width of 8 mm at half- 
maximum. The relationship between change in IQ and change in brain structure 
was investigated by entering the appropriate pre-processed images (modulated or 
unmodulated grey or white matter) into within-subject paired t-tests, with change 
in IQ (VIQ, PIQ or FSIQ) and year of scan as covariates. The degree to which IQ at 
time 2 was predicted by changes in brain structure was investigated in a hierarch- 
ical regression analysis with IQ at time 1 entered before change in brain structure. 
Details of the functional imaging method have been reported elsewhere**** and 
are summarized in Supplementary Information. 
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Bidirectional resection of DNA double-strand breaks 


by Mrell and Exol 


Valerie Garcia!, Sarah E. L. Phelps', Stephen Gray’ & Matthew J. Neale! 


Repair of DNA double-strand breaks (DSBs) by homologous recom- 
bination requires resection of 5’-termini to generate 3’ -single-strand 
DNA tails’. Key components of this reaction are exonuclease 1 and the 
bifunctional endo/exonuclease, Mrel1 (refs 2-4). Mrell endonu- 
clease activity is critical when DSB termini are blocked by bound 
protein—such as by the DNA end-joining complex’, topoisomerases® 
or the meiotic transesterase Spo11 (refs 7-13)—but a specific function 
for the Mre11 3’-5’ exonuclease activity has remained elusive. Here 
we use Saccharomyces cerevisiae to reveal a role for the Mre11 exo- 
nuclease during the resection of Spoll-linked 5’-DNA termini in 
vivo. We show that the residual resection observed in Exol-mutant 
cells is dependent on Mre11, and that both exonuclease activities are 
required for efficient DSB repair. Previous work has indicated that 
resection traverses unidirectionally’. Using a combination of physical 
assays for 5’-end processing, our results indicate an alternative mech- 
anism involving bidirectional resection. First, Mre11 nicks the strand 
to be resected up to 300 nucleotides from the 5’-terminus of the 
DSB—much further away than previously assumed. Second, this nick 
enables resection in a bidirectional manner, using Exol in the 5’-3’ 
direction away from the DSB, and Mrel11 in the 3’-5’ direction 
towards the DSB end. Mre11 exonuclease activity also confers resist- 
ance to DNA damage in cycling cells, suggesting that Mre11-catalysed 
resection may be a general feature of various DNA repair pathways. 

We sought to clarify Mrell-dependent processing reactions at 
DSBs with blocked termini. Blocked DSB ends are created in meiosis 
by the topoisomerase-like transesterase Spoll to initiate crossover 
recombination” (Supplementary Fig. 1a). Aside from the importance 
of meiotic recombination to sexual reproduction and genetic diversity, 
we reasoned that the molecular reactions pertaining to repair of 
covalently blocked Spol1-DSBs would be informative to the repair 
of DNA lesions with other types of end blockage, such as failed 
topoisomerase reactions and DSB ends competitively bound by the 
non-homologous end-joining (NHEJ) machinery. 

Mre11-dependent endonucleolytic processing (nicking) of Spol1- 
DSBs generates two classes of Spoll-oligonucleotide fragments 
originating from the DSB end’* (Supplementary Fig. 1). Spoll- 
oligonucleotide complexes do not form in mre11 mutants completely 
abrogated for nuclease activity (mrel1-D56N and mrel1-H125N; 
Fig. 1a), corroborating earlier conclusions that Mre11 directly processes 
Spol1-DSBs’""’. Recently, a mutant of Mre11 that is exonuclease defi- 
cient, but mostly endonuclease proficient was described for the ortho- 
logous proteins in Schizosaccharomyces pombe and Pyrococcus 
furiosus'®. We generated the equivalent mutation in S. cerevisiae 
Mrel1 (His 59 to Ser; Supplementary Fig. 2a) and tested its function. 
During meiosis, Spoll-oligonucleotide complexes were readily 
detected in mre11-H59S cells (Fig. 1a), indicating that the allele is 
endonuclease proficient in vivo. Biochemical assays with recombinant 
Mrel11-H59S showed reduced, but not abolished, 3’-5’ exonuclease 
activity on linear duplex DNA, and retention of much of the single- 
stranded (ss)DNA endonuclease activity (Supplementary Fig. 2)— 
observations consistent with mrel1-H59S partially separating the 
two nuclease functions. 


We assessed DSB repair kinetics at two meiotic recombination hot- 
spots using Southern blotting and probes for the relevant genomic loci 
(HIS4::LEU2 and ARE]; Fig. 1b and Supplementary Fig. 3). In the 
mre11-H59S strain, DSBs formed at normal levels and repaired as 
crossovers with normal timing (Fig. 1b, c). ssDNA resection, which 
can be qualitatively assessed by the relative migration of the DSB band 
on native agarose gels, also seemed unaffected by the mrel1-H59S 
mutation (Fig. 1d, lanes 1 and 6, and Supplementary Fig. 4). 

The lack of an obvious defect in ssDNA resection proficiency sug- 
gested that any potential contribution to ssDNA generation by Mre11 
might be masked by the activity of another nuclease. During meiosis, 
the major resection pathway requires Exol (refs 17-21). We tested 
this idea by combining the mre11-H59S allele with an EXO1 deletion 
(exo1A), which we found itself to have slightly delayed DSB repair 
kinetics, with fewer DSBs repairing as crossovers (Fig. 1b, c). 
Migration of the DSB band on agarose gels revealed that ssDNA re- 
section was measurably reduced in exo] A—but not entirely abolished 
relative to a mre11-H125N control (where the failure to remove Spol11 
prevents all resection; Fig. 1d, lanes 3, 4 and 5, and Supplementary 
Fig. 4). Our data agree with those of others investigating a meiotic role 
for Exol (refs 17-21). 

The combination of mre11-H59S with exo1A caused DSBs to tran- 
siently accumulate for a longer period, with DSBs detectable for 2h 
longer than in matched controls (Fig. 1b), and with formation of 
crossover recombinants reduced and delayed (Fig. 1c). These defects 
in DSB repair are correlated with a reduction in the mobility of DSB 
DNA on agarose gels (Fig. 1d and Supplementary Fig. 4). Specifically, 
in comparison to the mrel1-H59S or exo1A single mutants, we 
observed DSB signals to migrate similarly to DSBs from the mre11- 
H125N control (Fig. 1d, lanes 2 and 3). This reduction in mobility is 
indicative of less ssDNA resection in mre11-H59S/exo1A than of either 
single mutant. We conclude that Exol and Mre11 collaborate to enable 
efficient ssDNA generation at meiotic DSBs. 

Unrepaired DSBs activate Tell and Mecl (yeast orthologues of the 
mammalian checkpoint kinases ATM and ATR respectively) to induce 
phosphorylation of histone H2Ax on Ser 129 and hyper-phosphorylation 
of Hop] (a meiosis-specific adaptor of the DNA damage response”). 
Phosphorylated H2Ax and Hop] were detected in all strains, indicating 
that significant resection is not essential for checkpoint activation 
(Fig. le). However, phospho-H2Ax accumulated and persisted until 
late time points only in mre11-H59S/exo1A, and Hop] phosphorylation 
persisted for at least 2 h longer than in matched controls. These obser- 
vations are consistent with a genome-wide defect in DSB repair in 
mre11-H59S/exo1A. The Tell branch of the signalling pathway is 
primarily activated by unresected DSBs”. We observed that Hop] 
phosphorylation in the mrel1-H59S/exo1A background is highly 
dependent on Tell (Supplementary Fig. 5), consistent with less re- 
section occurring genome-wide. 

To investigate if DSB processing defects were affecting meiotic chro- 
mosome segregation, we determined the efficiency of progression 
through anaphase I and II (Fig. 1f). Meiotic progression of wild type 
and mre11-H59S was essentially identical, whereas exo1A was delayed 
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Figure 1 | Mre11- and Exol-dependent resection and repair of meiotic 
DSBs. a, Autoradiograph of Spol 1-oligonucleotide formation (arrowheads) in 
Mrel11 nuclease-defective cells: mre11-D56N, -H125N, -H59S; and control 
spol1-Y135F cells in which Spol1-DSBs do not form. WT, wild type. 
Immunoprecipitated Spol1-oligonucleotide complexes are 3’-end-labelled 
using terminal transferase and separated on SDS-PAGE" (see Supplementary 
Fig. 1). Asterisk marks an unrelated labelling artefact. b-f, Time course of 
events during meiosis for the indicated genotypes. b, c, Quantitative analysis of 
DSB (b) and crossover (c) signals at HIS4::LEU2. DNA at each time point was 
digested with PstI (b) or Xhol (c), separated by agarose gel electrophoresis and 
blots hybridized with probes to LSB5 (b) or STE5O (c). DSB signals were 
quantified as a percentage of specific lane signal (see also Supplementary Fig. 3). 


by about 1h, with slightly reduced overall efficiency. In contrast, 
nuclear division in the mre11-H59S/exo1A double mutant was poor, 
with more than half of the cells having failed to complete even the first 
meiotic nuclear division after 10h in meiosis. Analysis of sporulation 
efficiency after 24h revealed increased incidence of aberrant tetrad 
maturation and/or nuclear packaging where orphaned chromo- 
some fragments were observed outside the maturing spore wall 
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d, Relative extent of DSB resection at HIS4::LEU2 (at 4h in meiosis). DNA was 
digested with BglII and blots hybridized with a STE50 probe (see also 
Supplementary Fig. 4). e, Western blot analysis of phosphorylated H2Ax and 
Hop! (arrowhead) detected from TCA-precipitated whole-cell lysates. 
Phosphorylated Hop] is indicated by dot. Asterisk marks cross-reacting band. 
f, Progression through anaphase I and II assessed by microscopic examination 
of 4',6-diamidino-2-phenylindole (DAPI)-stained cells. g, Distribution of 
spore viabilities per tetrad (n, number of 4-spore tetrads dissected). The 
difference in distribution between mre11-H59S/exo1A and exo1A is highly 
significant ( a test for goodness-of-fit; y = 63.084, 4 degrees of freedom, 
P<0,0001). Absolute spore viabilities are: wild type, 97%; mre11-H59S, 90%; 
exolA, 76%, mre11-H59S/exo1A, 59%. 


(Supplementary Fig. 6). These defects in meiotic chromosome segrega- 
tion manifested as reduced spore viability in the double mutant com- 
pared to controls (Fig. 1g). 

To characterize in greater detail the molecular defect caused by 
mrel1-H59S, we looked carefully at the distribution of Spoll- 
oligonucleotide products generated in vivo by the 5’-DSB processing 
reaction (Fig. 2a, b). In wild-type cells, two major classes of Spol1l- 
oligonucleotide are observed, which differ by the length of attached 
DNA”. By contrast, in mre11-H59S we observed a shift in this dis- 
tribution towards higher molecular weight Spol1-oligonucleotide 
species (Fig. 2a and Supplementary Fig. 7). Importantly, total Spol1- 
oligonucleotide formation was not itself delayed (Supplementary 
Fig. 7), indicating that the Spoll-removal reaction initiates with 
normal timing in mre11-H59S cells, but is defective in forming shorter 
molecules. To clarify the precise size distribution of the Spoll- 
oligonucleotide molecules, we fractionated de-proteinized oligonu- 
cleotides using denaturing polyacrylamide gel electrophoresis 
(PAGE; Fig. 2b). In wild-type cells, two peak areas of signal 10-17 
nucleotides (nt) and 28-40 nt were apparent. These correspond to 
the shorter and longer Spoll-oligonucleotide classes detected on 
SDS-PAGE (Fig. 2a). We additionally detected signal (24% of total) 


Figure 2 | Mrel1-exonucleolytic processing of DSB ends. a, b, Spol1- 
oligonucleotide detection in wild type (WT) and mre11-H59S during meiosis. 
3'-end-labelled Spol1 complexes are fractionated by SDS-PAGE (a) or by 
nucleotide resolution urea/PAGE after proteolytic removal of Spol1 peptide 
(b). c, Mre11-H59S shows reduced 3’-5’ exonuclease activity on a nicked 
duplex. Reactions were performed as for Supplementary Fig. 2c. Stars indicate 
5' label, F3 3’ end has 5-nt extension and is refractory to Mrel1-mediated 
resection’’. d, Chromatin association of Spol1-oligonucleotide complexes. 
Wild-type cell extracts were fractionated and the abundance of Spol1- 
oligonucleotide complexes assessed in soluble versus chromatin-enriched 
material. e, f, Nuclease resistance of Spol1-oligonucleotide (Spo11-oligo) 
complexes. Chromatin-enriched material from d was treated with DNase I, and 
abundance of Spo1 1-oligonucleotide complexes compared to the simultaneous 
degradation of genomic DNA (e), or of a control 55-nt oligonucleotide (f). 
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in molecules 18-27 and 41-300nt long (Fig. 2b), indicating that 
the length distribution of processed molecules is significantly more 
heterogeneous than previously assumed, and that nicks are made at 
up to 300 nt from the DSB end. 

In the mre11-H59S mutant, the distribution was shifted such that 
the long oligonucleotide molecules of 41-300 nt made up a third of the 
material detected (Fig. 2b). Although it is possible that the altered 
distribution of oligonucleotide molecules is caused by a reduction in 
the endonuclease activity of Mre11-H59S, our physical and genetic 
observations in both wild type and mrel1-H59S can be readily 
explained if resection begins at relatively distant nicks (up to 300 nt 
from the DSB) and traverses bidirectionally both away from (using 
Exol) and towards (using Mrel1) the DSB end’. Such a model is 
consistent with the opposing polarities of the Mre1 1 and Exol exonuclease 
activities*°, and with the synergistic loss in resection we observe in 
mre11-H59S/exol1A. Moreover, the length distribution of Spoll- 
oligonucleotides is compatible with the extent of Exol-independent 
resection reported recently by others” and, as predicted, Spoll- 
oligonucleotide length is unchanged by loss of EXO1 (Supplemen- 
tary Fig. 8). Lastly, only a low background of Spol 1-oligonucleotide 
complexes are detected in endonuclease-defective mre11-D56N, ruling 
out the possibility that the long oligonucleotide molecules arise via an 
alternative nuclease nicking the 5’ strand (Fig. 2b). To test this mech- 
anism in vitro, we incubated Mrell protein with a nicked duplex 
substrate designed to mimic this proposed in vivo reaction, and found 
that Mrel1-H59S resected from the nick with lower efficiency than 
wild-type Mrel1 (Fig. 2c). 

Together, these observations led us to consider that the steady-state 
length of Spol1-oligonucleotide complexes might arise via the relative 
processivity of the 3’-5’ Mre11 exonuclease and the relative sensitivity 
to nucleolytic degradation of DNA close to the DSB end. Spol1-DSB 
formation requires at least ten factors’, suggesting that a large protein 
complex may reside at—and protect—the DSB end. If this model were 
correct, we expected Spol11l-oligonucleotide complexes to be asso- 
ciated with chromatin and resistant to nucleolytic degradation. We 
tested this idea by incubating a chromatin-enriched nuclear pellet 
from meiotic yeast cells with DNase I, then assessed the quantity of 
Spol1-oligonucleotide complexes remaining, relative to both bulk 
DNA and to an exogenous protein-free oligonucleotide included in 
parallel reactions (Fig. 2d-f). Greater than 90% of Spo1 1-oligonucleotides 
are found in the chromatin fraction and, remarkably, no loss in signal 
was observed despite extensive nucleolytic degradation of both the 
chromosomal DNA and the control oligonucleotide. We conclude that 
Spol1-oligonucleotide complexes are occluded from degradation even 
by exogenous nucleases—a prediction of our model. 

In cycling cells, Mrel1 nuclease activity promotes the onset of 
resection—a requisite for repair of DSBs by homologous recombina- 
tion’. To investigate a specific role for the Mrel1 3’-5' exonuclease 
during DNA repair in cycling cells, we challenged yeast cells with 
exposure to DNA damaging agents. Similar to complete abrogation 
of the endo/exonuclease activities (mrel1-H125N), reduced Mrel1 
exonuclease activity (mre11-H59S) sensitized cells to the DNA alkylat- 
ing agent methyl methanesulphonate (MMS) and to the topoisomerase 
poison camptothecin (CPT; Fig. 3). Compared to an MRE11 deletion, 


40 uM CPT 0.0250% MMS 
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Figure 3 | DNA damage sensitivity of exonuclease-defective Mre11 cells. 
Tenfold serial dilutions of the indicated strains were spotted onto solid media 
containing the indicated compounds and incubated at 30 °C for 2 days (CPT) 
or 3 days (MMS). 
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however, mrel11-H59S and mrel11-H125N are themselves far less 
sensitive, consistent with physical interactions within the Mrel1 com- 
plex being retained (Supplementary Fig. 9). In agreement with Mrel1 
endonuclease activity being unaffected in mre11-H59S, and allowing 
redundant processing pathways, combining mre11-H59S with a dele- 
tion of EXO1 did not further sensitize cells to MMS (Supplementary 
Fig. 10). Together these observations indicate that the exonuclease 
activity of Mrel11 is involved in the repair of various classes of DNA 
lesion. 

Understanding the regulation of DSB repair is a complex issue 
involving multiple factors with overlapping roles. Here, we propose 
a biological function for the 3'-5’ exonuclease activity of the evolu- 
tionarily conserved Mre11 protein. Previous work has indicated DSB 
resection to traverse unidirectionally’. We propose a refined model 
that involves the coordination of two resection activities of opposing 
polarity: Exol away from the DSB and Mrel1 towards the DSB end 
(Fig. 4). We favour the view that this exonuclease reaction begins at 
nicks created by the Mrel1 endonuclease (in conjunction with Sae2) 
and which are positioned at variable distance from the DSB end, 
perhaps due to locus-specific chromatin architecture”®. Although our 
assay detects only the site of incision closest to the DSB end, Mre11 
and Sae2 may create multiple nicks on the resecting strand'”?!”” 
(A. S. H. Goldman, personal communication) that, in combination 
with exonucleolytic processing, might further enhance resection effi- 
ciency. Lastly, the recent observation that the length and abundance 
of SPO11-oligonucleotide complexes is increased in Atm ‘~ mice”* 
(S. Keeney and M. Jasin, personal communication), suggests that 
Mrel1 exonuclease activity may be an evolutionarily conserved feature 
directly regulated by ATM. 

During meiosis, bidirectional processing may help to reinforce sub- 
sequent steps of repair, which at least in some cases seem to occur 
differentially on either side of the DSB**’?”°. Our observation that 
liberated Spol1-oligonucleotide complexes remain chromatin bound 
and relatively protected provides a clue to potential mechanisms of end 
differentiation. For example, retention of proteins on one or both of 
the DSB ends could influence subsequent steps of repair’ (Fig. 4b). 
Lastly, our observation that Mrel1-dependent incision occurs at some 
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Figure 4 | Model for bidirectional processing of DSBs by Mre11 and Exol. 
a, After DSB formation with blocked ends (hatched squares), Mre11/Sae2- 
dependent nicks flanking the DSB ends create initiation sites for bidirectional 
resection by Exol and/or Sgs1-Dna2 away from the DSB, and by Mre11 
towards the DSB end. Such terminal blocks could arise after base damage, 
trapping of a topoisomerase, or by avid binding of the NHEJ complex. 3’ ends 
are marked with triangles. Mrel1/Sae2 may make multiple nicks on the 5’ 
strand (light grey arrows), facilitating resection. b, In meiosis, the DSB ends are 
terminally blocked by covalently bound Spo11 protein (grey ellipses), and may 
be protected from Mrel1-dependent exonuclease degradation by a large 
metastable multisubunit complex (dashed outline), thereby generating the 
observed size distribution of Spol1-oligonucleotide complexes. 
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distance from the DSB end indicates that DSB formation and proces- 
sing reactions are coordinated over a considerable distance (300 bp of 
B-form DNA is ~100 nm). How these incision points are regulated— 
and restricted to only the 5’-ending strand—are fascinating questions 
for the future. 

Although much of this work concerns specifics of meiotic DSB pro- 
cessing, we suggest that a similar pathway may occur whenever the DSB 
end is blocked by a lesion or protein complex that prevents direct 
loading of the 5’-3’ exonuclease machinery. Such DNA blockages 
may be crosslinked protein, damaged DNA ends, or simply the stable 
binding of high affinity proteins to the DSB. As such, the nucleolytic 
incision pathway may provide the key point of regulation that controls 
the balance between NHEJ and homologous recombination. 


METHODS SUMMARY 


Yeast strains and culture methods. Meiotic cultures were prepared as 
described'*. Strains were derived from SK1 using standard techniques. Spol1 
protein is tagged by the HA3His6 epitope’’. Mrel1 mutations were introduced 
by pop-in/out using plasmids derived from pRS414-MRE11 (ref. 10). exo1A, tell1A 
and sae2A are full replacements of the open reading frame (ORF) with kanMX4, 
hphNT2 and kanMX6, respectively. A full strain list is available on request. For 
chronic DNA damage sensitivity, exponentially growing cultures were spotted in 
tenfold dilutions onto freshly prepared media. 

Molecular techniques. Spol1-oligonucleotide complexes were detected by two 
rounds of immunoprecipitation and end labelling as described’*. DSBs and cross- 
over recombinants were detected by Southern blotting genomic DNA after frac- 
tionation on agarose gels using standard techniques”. Radioactive signals were 
collected on phosphor screens, scanned with a Fuji FLA5100, and quantified using 
ImageGauge software. Phosphorylated H2Ax and Hop] protein were detected 
using Ab17353 (Abcam) and anti-Hop1l antisera (F. Klein), respectively. 
Subcellular fractionation of cell contents was performed by purifying hypotonically 
lysed spheroplasts through a sucrose cushion. Chromatin digests were performed at 
25 °C with DNase I (NEB). Reactions were split and processed for DNA extraction 
or Spoll-oligonucleotide purification. As a positive control, parallel reactions 
included a 5’-labelled 55-nt oligonucleotide. Recombinant GST-Mre11 proteins 
were purified and reacted with DNA substrates as described’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Yeast strains. Strains of S. cerevisiae were derived from SK1 using standard 
techniques. Spol1 protein is tagged by the HA3His6 epitope’. Mrel1 mutations 
were introduced by pop-in/out using plasmids derived from pRS414-MRE11 (see 
below). exo1A, tel1A and sae2A are full replacements of the open reading frame 
(ORF) with kanMX4, hphNT2 and kanMX6, respectively. A full strain list is 
available on request. 

Meiotic cultures. YPD cultures (1% yeast extract, 2% peptone, 2% glucose) were 
diluted 100-fold into YPA (1% yeast extract, 2% peptone, 1% K-acetate) and grown 
vigorously for 14h at 30 °C. Cells were collected by centrifugation, washed once in 
water, resuspended in an equal volume of prewarmed 2% K-acetate containing 
diluted amino acid supplements, and shaken vigorously at 30 °C. 

Construction of Mre11 nuclease mutant strains. The mre11-H59S mutant was 
constructed by site-directed mutagenesis PCR on the integrating plasmid pSM444 
(pRS406::mre11-D56N), reversing the D56N mutation to wild type and intro- 
ducing H59S (forward primer: GTACAGTCCGGTGATCTTTTTAGCGTG 
AATAAGCC; reverse primer: GGCTTATTCACGCTAAAAAGATCACCGG 
ACTGTAC). Integration plasmids containing mre11-D56N and mrel1-H125N 
nuclease dead mutant alleles were provided by L. Symington (pSM444 and 
pSM438 respectively). To replace chromosomal MRE11 with mutant alleles, the 
plasmid containing mutations in the nuclease domains of MRE11 were linearized 
with SphI and transformed in the SK1 strain. Uracil-positive (Ura*) transfor- 
mants were inoculated in rich medium overnight and 20 ul of culture were spread 
onto medium containing 5-fluoroorotic acid (5-FOA) in order to select pop-out 
events. The presence of the mre11-D56N and mre11-H125N alleles in the resulting 
5-FOA resistant cells was assessed by sensitivity to DNA damaging agent: cells 
from single 5-FOA resistant colonies were patched onto plates containing high 
concentration of CPT (30-50 1M) supplemented with phloxin B. Because the 
introduction of the mre11-H59S mutation also altered a PmlI restriction site, 
the presence of the mre11-H59S allele was tested by sensitivity of PCR reactions 
covering the mutated region to Pmll restriction. Replacement of wild-type MRE11 
by targeted mutations was confirmed by PCR amplification and sequencing. 
Spot tests for DNA damage sensitivity. Cells were grown overnight in liquid 
YPD, diluted into fresh media and grown to log phase, adjusted to OD¢00 nm = 0.2, 
serially diluted tenfold, and 5 ul spotted onto control and drug-containing YPD 
plates. 

Spo11-oligonucleotide assays. Spol1-oligonucleotide complexes were detected 
by immunoprecipitation and end labelling following established methods'*. To 
reduce the co-precipitation of nonspecific genomic DNA, two rounds of immuno- 
precipitation were used. Specifically, 10-50 ml of sporulating culture was lysed in 
10% ice-cold TCA using zirconia beads and a BioSpec 24. Precipitated material 
was dissolved in STE (2% SDS, 0.5 M Tris pH 8.1, 10 mM EDTA, 0.05% bromo- 
phenol blue), and boiled for 5 min. Extracts were diluted twofold in 2X IP buffer 
(2% Triton X-100, 300 mM NaCl, 30 mM Tris-HCl pH 8.1, 2mM EDTA), cen- 
trifuged for 10min at 16,000g at 4°C, and supernatant was diluted a further 
twofold in 1x IP buffer. Anti-HA antibody (F-7; Santa Cruz Biotechnology) 
was added at 1 in 500, protein-G-agarose matrix (Roche) at 1 in 50, and then 
incubated with rotation for 4h at 4 °C. Immune complexes were collected by low 
speed centrifugation, and washed three times with 1x IP buffer. Beads were boiled 
for 5 min in 250 ul STE, chilled on ice, and diluted twofold as above with 2x IP 
buffer, recentrifuged, and supernatant was dissolved further twofold in 1x IP 
buffer. Fresh antibody and beads were added at the above dilutions. The second 
immunoprecipitation was performed overnight at 4 °C, and then washed as above. 
Two additional washes in 1X TKAc (20mM Tris-acetate pH 7.9, 50mM 
K-acetate) were performed before incubation with 10-20 units TdT (Fermentas) 
and 5-10 uCi CoTP (cordycepin triphosphate; Perkin Elmer) in 1x TKAc, 
0.25mM CoCl, buffer at 37°C for 1h. Labelled complexes were washed twice 
with 1X IP buffer, and eluted in Laemmli loading buffer for direct analysis on 
7.5% SDS-PAGE. For nucleotide resolution analysis of Spol1-oligonucleotide 
lengths, eluted complexes were mixed with 1 1g glycogen (Roche) and 10 volumes 
of 100% ethanol, and precipitated overnight at —20 °C. Precipitates were collected 
by centrifugation, dissolved in 15 pl of 10X TE containing 0.5 1g ml’ proteinase 
K, and incubated at 50 °C for 30 min. Eluted oligonucleotides were mixed with 3 
volumes of loading dye (95% formamide, 10mM EDTA, 0.01% bromophenol 
blue, 0.01% xylene cyanol), and fractionated through a 28-cm tall, 0.5-mm thick 
12% polyacrylamide (19:1), 6M urea gel in 1X TBE running buffer at approx 
1,200 V for 50-60 min. Gels were fixed in 10% methanol, 7% acetic acid and 5% 
glycerol, vacuum dried and exposed to phosphor screens for imaging. 

DSB and crossover analysis. Genomic DNA was isolated from aliquots of 
synchronously sporulating cultures using standard methods. Briefly, spheroplasts 
were prepared in 1 M sorbitol, 0.1 M EDTA, 0.1 M NaHPO, pH 7.5, 1% BME and 
200 pg ml! zymolyase 100T for 20-30 min at 37 °C, and lysed by adding SDS to 
0.5% and proteinase K to 200 jg ml‘ with incubation for 60 min at 60 °C. Protein 
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was removed by mixing with an equal volume of phenol:chloroform:isoamyl 
alcohol (25:24:1), and nucleic acids precipitated by adding one-tenth volume of 
3M NadAc pH 5.2 and an equal volume of 100% ethanol. Precipitates were washed 
in 70% ethanol and dissolved in 1X TE containing 100 1g ml! RNase, incubated 
for 60 min at 37 °C, reprecipitated with ethanol and NaAc and DNA pellets left to 
dissolve in 1X TE overnight at 4°C. Genomic DNA was digested with restriction 
enzymes following the manufacturer's recommendations, fractionated on 0.6%, 
0.7% or 0.9% agarose gels in 1X TAE or 1X TBE buffer and ethidium bromide (as 
required), and blotted under vacuum to zetaprobe membrane (BioRad) in 0.5 M 
NaOH, 1.5M NaCl. Blots were equilibrated in hybridization solution (0.5M 
NaHPO, pH 7.2, 7% SDS, 1mM EDTA, 1% BSA) at 65 °C, and random primed 
radioactive probes (High Prime; Roche) prepared from gel-purified PCR products 
were then added. Blots were washed four times in 40 mM NaHPO,, 1% SDS, 1 mM 
EDTA at 65°C, then dried and exposed to phosphor screens for imaging. 
Western blotting. TCA-denatured cell material dissolved in STE and 5% 
B-mercaptoethanol was fractionated in 7.5% or 10% SDS-PAGE, transferred to 
PVDF membrane (Millipore) in 1X CAPS buffer (10 mM CAPS-NaOH pH 11, 
10% methanol), and incubated in TBST + 3% BSA with the requisite antibodies: 
phosphorylated H2Ax was detected using Ab17353 diluted 1/5,000 in TBST 
(Abcam) and Hop] detected using anti-Hop1 antisera at a 1/4,000 dilution’’. 
DAPI staining. Cells were fixed in 100% methanol, and aliquots mixed with 1 pg 
ml! DAPI in 75% glycerol. The number of cells with 0, 2 and 4 nuclei was scored 
under a fluorescence microscope. At least 200 cells were counted per time point. 
Sporulation efficiency was determined by observation of fixed tetrads (24h into 
meiosis, on three independent time courses) under a Zeiss fluorescence microscope 
with both fluorescence and bright-field illumination. Samples were randomized 
and 150 to 300 cells were counted for each strain per time course. Sporulations were 
scored as abnormal when significant extranuclear material was detected by DAPI 
staining and the wall morphology of spores was abnormal under bright-field 
illumination. DAPI and DIC images were captured using a personalDV 
(DeltaVision) system (Applied Precision) using the softWoRx software. 
Chromatin enrichment. Cell pellets corresponding to 100 ml meiotic cultures 
were resuspended in 30 ml Sphero/CoHex buffer (1 M sorbitol, 50 mM HEPES pH 
7.5, 10mM EDTA, 5mM hexamine cobalt chloride) supplemented with 1% 
B-mercaptoethanol and zymolyase 100T to 50pgml~', and incubated for 
~25min at 37 °C. Spheroplasts were transferred onto 10 ml sucrose cushion, spun 
for 5 min at 16,000g at 4 °C. Spheroplast pellets were resuspended in 1 ml sucrose 
buffer (1M sucrose, 50mM MES-NaOH, 50mM NaCl, 1 mM EDTA, 0.5mM 
MgCl, Roche protease inhibitors cocktail, AEBSF to 10 pg ml), lysed in 30 ml 
lysis buffer (50 mM MES-NaOH, 1mM MgCl, 5mM EDTA, Roche protease 
inhibitors cocktail, AEBSF to 10 jig ml ~ ty. homogenized in a dounce homogenizer, 
and spun 5 min at 16,000g at 4 °C. Pellets were suspended in 30 ml lysis buffer and 
subcellular fractionation of cell contents was performed by centrifugation through 
a 10-ml sucrose cushion. Pellets were washed once in the same buffer, suspended in 
sucrose buffer, and 300 il aliquots were stored at —20 °C. Aliquots were incubated 
at 25 °C with DNase I (NEB) in 50 mM HEPES, 50 mM NaCl, 5 mM MgCl, buffer 
supplemented with complete protease inhibitors tablet (Roche) and AEBSF. 
Reactions were split and processed for genomic DNA extraction or Spoll- 
oligonucleotide purification. As a positive control, parallel reactions included a 
5'-labelled 55-base oligonucleotide (R1) that was detected on 1X TBE PAGE. 
Recombinant protein purification. GST-Mrell proteins were purified and 
reacted with DNA substrates as described’®. Briefly, MRE11 wild-type, mre11- 
H59S and mre11-D56N open reading frames (ORFs) were amplified by PCR, 
cloned with BamHI/Sall ends into pEG-KT plasmid and were transformed into 
the protease defective JEL1 mre11A strain (LSY1706; a gift from L. Symington). 
Galactose induction was performed as described in http://www.bio.brandeis.edu/ 
haberlab/jehsite/protocol.html. Cells from a 100 ml culture were collected by cent- 
rifugation for 5 min at 3,000g at 4 °C, washed in cold lysis buffer (20 mM Tris-HCl 
pH 8, 1 mM EDTA, 500 mM NaCl, 0.1% Triton X-100, 10% glycerol), and lysed in 
500 ll lysis buffer supplemented with 50 mM NaF, 10 pg ml! AEBSF and com- 
plete protease inhibitor tablet (Roche) using zirconia beads and a BioSpec 24 
apparatus. Lysates were cleared by centrifugation for 10 min at 16,000g at 4°C, 
and supernatants were incubated with 200 ul of Glutathione Sepharose beads 
slurry (pre-washed in lysis buffer) for 1h at 4°C. Beads were washed three times 
in lysis buffer and proteins were eluted over 30 min at 4°C in elution buffer 
(50mM Tris pH 8, 1mM EDTA, 20mM glutathione reduced, 40% glycerol, 
supplemented with protease inhibitors), and aliquots were frozen in liquid nitro- 
gen and stored at —80 °C. 

Oligonucleotide nicked substrate and exonuclease assay. The nicked, double- 
stranded substrate described in Fig. 2c was made by 5’-[°*P] end labelling of 
oligonucleotides F2: GACCTGGCACGTAGGACAGCATGGGATCTGGCCTG; 
and F3: TGTTACACAGTGCTACAGACAatggt (lowercase indicates 3’ over- 
hang) with PNK; and annealing to the unlabelled reverse complement, R1: 
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GTCTGTAGCACTGTGTAACACAGGCCAGATCCCATGCTGTCCTACGTGC 
CAGGT. Annealing was verified by migration on 1X TBE, 12% polyacrylamide 
gel. ~0.66 pmol of double-stranded nicked substrate was incubated with 
~0.13 pmol of recombinant protein in nuclease buffer (25mM MOPS pH 7, 
60mM KCl, 0.2% Triton X-100, 2mM DTT, 50mM MnCl,) for the indicated 
length of time at 37°C. Reactions were stopped by adding SDS to 0.3% and 
proteinase K to 0.5mgml * and incubated for 15 min at 37 °C. Reactions were 


denaturated in 2 volumes of formamide loading buffer (95% formamide, 10 mM 
EDTA, 0.005% bromophenol blue, 0.01% xylene cyanol) at 95 °C and separated on 
a 6M urea, 12% polyacryamide gel in 1X TBE. Gels were exposed to a phosphor 
screen for imaging. 


31. Panizza, S. et al. Spol1-accessory proteins link double-strand break sites to the 
chromosome axis in early meiotic recombination. Cel/ 146, 372-383 (2011). 
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Multiple routes to mammalian diversity 


Chris Venditti', Andrew Meade? & Mark Pagel”? 


The radiation of the mammals provides a 165-million-year test case 
for evolutionary theories of how species occupy and then fill eco- 
logical niches. It is widely assumed that species often diverge 
rapidly early in their evolution, and that this is followed by a 
longer, drawn-out period of slower evolutionary fine-tuning as 
natural selection fits organisms into an increasingly occupied 
niche space’”. But recent studies have hinted that the process 
may not be so simple**. Here we apply statistical methods that 
automatically detect temporal shifts in the rate of evolution 
through time to a comprehensive mammalian phylogeny’® and data 
set’ of body sizes of 3,185 extant species. Unexpectedly, the majority 
of mammal species, including two of the most speciose orders 
(Rodentia and Chiroptera), have no history of substantial and sus- 
tained increases in the rates of evolution. Instead, a subset of the 
mammals has experienced an explosive increase (between 10- and 
52-fold) in the rate of evolution along the single branch leading to 
the common ancestor of their monophyletic group (for example 
Chiroptera), followed by a quick return to lower or background 
levels. The remaining species are a taxonomically diverse assem- 
blage showing a significant, sustained increase or decrease in their 
rates of evolution. These results necessarily decouple morphological 
diversification from speciation and suggest that the processes that 
give rise to the morphological diversity of a class of animals are far 
more free to vary than previously considered. Niches do not seem to 
fill up, and diversity seems to arise whenever, wherever and at what- 
ever rate it is advantageous. 

Our approach uses a generalized least-squares model®*” of trait 
evolution in a Bayesian reversible-jump”’ framework that allows rates 
of evolution to vary in individual branches or entire monophyletic 
subgroups of a phylogeny (Supplementary Information). This allows 
us to trace the evolutionary history of shifts in the rate and timing of 
evolution without specifying in advance where these events are located, 
and to derive posterior probability density estimates of their magni- 
tudes and probability of occurrence (Supplementary Information). 
The null model states that evolution has proceeded at a constant rate 
throughout the class Mammalia. Applied to log-transformed body size 
data (n = 3,185 species) arrayed on the mammalian tree® , this model 
returns a Bayesian posterior density of log-likelihoods with a mean of 
— 939.34 + 0.99 (Fig. 1a), and a mean instantaneous rate of body size 
evolution of 1.02g per million years. If rates are allowed to vary 
throughout the tree, the posterior density improves to a mean log- 
likelihood of —364.13 + 23.01 (log(Bayes factor) = 993.51; values 
>10 considered ‘very strong’ support''; Fig. 1a). We detect evidence 
for a shift or change in the rate of evolution in approximately one- 
third, or 1,494 branches, of the tree, where to be included in this count 
branches had to either experience a change in rate in that branch or 
inherit that change from its immediate ancestral lineage, in at least 95% 
of the trees in the posterior sample. These shifts range from a 3-fold 
decrease to a 52-fold increase in the rate of evolution along a branch 
(Fig. 1b). 

It has long been believed that the radiation of extant mammals 
underwent a burst of body-size evolution that occurred early in its 
history and coincided with the appearance of the mammalian orders, 


and that this was followed by a gradual slowdown towards the pre- 
sent*’*"*, Explanations for this pattern suppose that mammals moved 
into a largely unoccupied niche and geographical space as they came to 
be the dominant vertebrate group on Earth. Then, as time went on, 
niche space and unexplored geographical regions became scarce, 
reducing opportunities for diversification’. In striking contrast to this 
picture, we do not find any evidence for either a generalized burst of 
evolution early in mammalian evolution or for the rates of evolution to 
decrease as time moves towards the present (Fig. 2a). Instead, rates of 
evolution were low and stable for about the first 60 million years, only 
starting to increase around 90 million years ago and then showing only 
about a twofold increase over the previous ‘baseline’ rate. This increase 
occurred before the origin of the present-day mammalian orders and is 
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Figure 1 | Log-likelihood of trait models when rates are allowed to vary. 

a, Posterior distribution of log-likelihoods from a model with equal rates of 
evolution (red), compared with the posterior distribution of log-likelihoods 
from the model in which evolutionary rates are allowed to vary (green): 
log(Bayes factor) = 993.51 (calculated from the log-harmonic means of the 
likelihoods); values >10 considered ‘very strong’ support. b, The coloured bars 
show distributions of rates for the one-third of the branches (1,494) for which 
the posterior probability of having a rate shift was greater than 0.95. Blue bars 
signify x-fold rate increases and yellow bars indicate x-fold rate decreases. Grey 
bars show the distribution of the mean fold rates for all the branches in the 
mammal phylogeny, independent of the level of posterior support. 
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Figure 2 | Rates of mammalian morphological evolution through time. 

a, Mean time-dependent rates of evolution for the mammalian radiation taken 
as a whole. b, Mean time-dependent rates of evolution within each mammalian 
order. The colour key matches that in the dated phylogeny in a (dates taken 
from the corrigendum to ref. 6), which has been collapsed to the level of order; 


in accordance with estimates generated using a different technique’. 
Thus, rates of evolution increased at a point corresponding to the early 
splitting of the Laurasiatheria and the Afrotheria, and were then largely 
maintained until the present (Fig. 2a). 

The time-dependent rates of change for the individual orders reveal 
a variety of patterns beginning around 80 million years ago (Fig. 2b). 
These tell three broad stories. The first is that there is no general 
tendency for orders to show bursts of evolution early in their radia- 
tions, followed by a gradual descent to baseline. In fact, in several 
orders—for example Primates, Eulipotyphla and Chiroptera—rates 
of change remain below the average mammalian rate (Fig. 2b). The 
second is that what might seem to be increases followed by decreases in 
a few orders, notably in Carnivora, Cetartiodactyla, Diprodontia and 
Rodentia, do not reflect order-wide processes. Rather, they are brought 
about by large changes in one or a small number of branches within 
those orders (Fig. 3 and Supplementary Figs 1-11). The third is that 
rates of body size evolution—and by implication many of the mor- 
phological and life history traits that allometrically scale with size’-— 
are decoupled from speciation. The slowly evolving Chiroptera 
account for ~1,000 species, and the rate of evolution in Rodentia— 
the most speciose mammalian group, accounting for roughly 60% of 
all extant species—has rarely exceeded the mammalian trend (calcu- 
lated rates account for the shared ancestry the tree implies and so are 
not biased by speciose clades). Instead, the highest average rates of 
change occur in one of the least speciose groups. The Proboscidea, 
some of whose members—the large elephants—are, along with the 
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the start of each triangle indicates the first split in that order. Orders shaded 
grey in the phylogeny have rates throughout their evolutionary history that fall 
within the grey bar in b. The mean rates presented in a and b are calculated 
taking in to account the shared ancestry as implied by the phylogeny (mean rate 
decreases are less than one). 


Sirenia, the closest living relatives to the small hyrax species, evolve 
on average 4.6-fold faster than the mammalian norm. 

Looking more closely within orders, we see no evidence at any 
phylogenetic level for an ‘early-burst’ pattern of evolution. Instead, 
natural selection seems capable of altering body size spontaneously 
and over short periods of time. Thus, we observe short-term explosive 
increases of between 10- and 52-fold in the rate of evolution, distrib- 
uted widely among clades (Fig. 3). It is these that shape the diversity of 
mammals, in striking ways. For example, the branches leading to 
Atelidae (woolly, spider and howler monkeys) record a 10-fold 
increase in the rate of evolution; the ‘big cats’ (the genera Panthera, 
Acinonyx, Uncia and Puma) jump to a 35-fold increase; Hominidae 
increase 12-fold; and Mysticeti increase 25-fold. The highest single- 
lineage burst, in excess of 52-fold, occurs in the branch leading to the 
genera Dasyurus (quolls) and Sarcophilus (Tasmanian Devils). We also 
observe stark differences in rate between closely related sister groups in 
many places across the mammal tree, including in the musk ox (Ovibos 
moschatus), which is much larger than its closest relatives, the gorals 
(Naemorhedus); and in the pygmy marmoset (Callithrix pygmaea), 
which is considerably smaller than its close relatives. The megabat 
genus Taphozous shows a drastic generalized decrease in the rate of 
evolution throughout the whole clade, whereas the genus Dasyurus 
and close relatives within the order Dasyuromorphia show a clear 
generalized increase (Supplementary Information). In a few cases, 
we observe a generalized increase in the rate of evolution throughout 
an entire clade (for example in Carnivora and Proboscidea). These 
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Figure 3 | The mammalian phylogenetic tree scaled to reflect morphological 
evolution. The branches of the phylogeny are transformed by the mean of the 
posterior distribution of the scalars acting on each branch—the branches of the 


might reflect trends towards increased size such as those described in 
some palaeontological and neontological studies*’*"®. 

An important subset of the short-term bursts occurs in the single 
evolutionary lineage that leads to the common ancestor of a large 
monophyletic group. These ‘single-lineage ancestral bursts’ seem to 
correspond to drastic changes in an animal from some ancestral state 
to a size that seems to form the basis for a new radiation, and might be 
akin to the concept of quantum evolution'*. For example, the rate of 
evolution in the branch leading to Chiroptera is 24-fold higher than 
what we expect from a constant-rate model. This suggests that the 
common ancestor of the bats descended rapidly from a considerably 
larger, perhaps carnivore-, artiodactyl- or cetartiodactyl-like, ancestor, 
but that once it reached the ‘bat’ size, rates of change proceeded at 
background or even lower levels, even while the number of bat species 
increased dramatically. Other single-lineage ancestral bursts seem to 
characterize the changes leading to several orders in Laurasiatheria 
and Afrotheria (Table 1 and Fig. 3). These rate changes and those 
we observe more generally throughout the tree are large but are in line 
with those for some genes, or for some morphological traits whose 
changes have been measured in real time’”~””. 

Conventional models that estimate a single homogeneous process 
over an entire clade—such as an early burst—can be easily misled by 


tree are stretched and compressed to reflect the rate of morphological evolution. 
Also, the branches are coloured according to how much they have been scaled 
(scale factor shown in colour bar). 


rate shifts that are confined to one or just a few branches. For example, 
recent studies have shown strong support for the early-burst model in 
primates* and cetaceans”. However, our results show that these bursts 
are attributable, in the case of primates, to two explosive increases along 
single branches (one leading to Atelidae and one to Hominidae; see 
inset in Fig. 3), and, in the cetaceans (the branch leading to Mysticeti; 
Supplementary Information), to one, rather than to a general trend. In 
other cases, a subgroup within a larger taxonomic range has increased 
its rate of change (for example the genera Rhinolophus in Chiroptera, 
Peroryctes in Peramelemorphia, and Dasyurus and Sarcophilus in 
Dasyuromorphia; Fig. 3 and Supplementary Information). This pattern 


Table 1 | Fold-increase rate of evolution for mammalian orders 
whose ancestral lineage is increased more than tenfold 


Order 


Single-lineage rate* Mean ratet in monophyletic 


group (s.d.) 
Cetartiodactyla 18.7 3.6 (3.2) 
Chiroptera 24.1 1.9 (1.8) 
Perissodactyla 17.1 1.1 (0.1) 
Proboscidea 14.4 4.6 (0.2) 
Sirenia 15.2 2.2 (0.2) 


* Rate increase in the single lineage leading to the corresponding order. 
+ Mean and s.d. of the of the rate increases along the branches within the corresponding order. 
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too has been wrongly assigned to a homogenous model, in this case to 
fitting an Ornstein—Uhlenbeck process across the entire order or clade 
(see, for example, ref. 4). It is possible that the early-burst pattern exists 
in our data but that our methods lack the power to detect it, but we 
think this unlikely for two reasons. One is that our methods can reliably 
detect twofold, or smaller, changes in the rates of evolution (Figs 1b 
and 2b; see also simulations in Supplementary Information). The 
second is that relaxing our statistical criteria to make it easier for 
changes to be identified does not change our conclusions (Supplemen- 
tary Information). If early-burst patterns do exist in the mammals, they 
are so small as to be of little evolutionary significance in comparison 
to the striking shifts in the patterns of diversification we observe dis- 
tributed throughout the tree. 

We infer historical evolutionary patterns in the rate of body size 
evolution using a phylogeny constructed from extant species*’. 
Incorporating fossils*’’** in our analysis could reveal details of the 
patterns of evolution leading to extinct species, and these could even 
differ from those for extant species. For example, the rate of evolution 
in extinct groups such as the triconodonts and multituberculates, 
which fall between the extant monotremes and the rest of the extant 
mammals, could have a considerably higher (or lower) rate of change 
than extant groups. However, we would not expect the historical 
trends we describe here for extant species to change qualitatively with 
the addition of extinct groups, even if those extinct groups did show 
different patterns of change; and we would not expect the inclusion of 
fossils to produce generalized early-burst patterns of change. The 
reason is that the methodology we use is specifically designed to 
account for the sort of inhomogeneity that would occur if an extinct 
group had a distinct rate of change. Thus, if such a fossil group were 
included, our method would detect the difference in rate and scale in 
that section of the tree and leave the rates along the remainder of the 
phylogeny unchanged. The mammalian supertree® we use in this 
study, although for the most part well resolved, does contain some 
polytomies. Polytomies can artificially decrease the rate of evolution’, 
but we do not believe that this greatly affects our results: the lack of 
resolution in the tree is mostly found at the tips, yet we find many 
instances of very high rate shifts in terminal branches. In fact, we find 
the most instances of high rates at the tips in the rodent clade, which 
contains the most polytomies. 

Our results reveal that natural selection has been a precise and 
flexible shaper of mammalian size diversity, able to produce rapid 
changes in size in specific parts of the tree, and using a variety of 
‘substrates’, be they elephants, carnivores, whales and even some 
rodents. Contrary to the long-held view that the diversity in mammal 
body sizes we see today is the product of widespread and homogeneous 
macroevolutionary processes in the rate of evolution, we find that 
natural selection has found multiple different routes to producing 
the current diversity of sizes. Our results also challenge the view that 
ecological niches fill up, as we find no suggestion of a generalized 
slowing in the rate of evolution as clades expand. This might suggest 
that ecological niches are a constantly ‘moving target’ and that they 
move just as much for speciose clades as for more depauperate ones. 
Combined with evidence that speciation rates might be constant in 
many groups of species, and that speciation might itself be the outcome 
of unusual single events”, our results indicate that to understand so- 
called adaptive radiations it is necessary to study the multiple events in 
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a species’ life that provide it with the opportunity to adapt, rather than 
studying wide and general processes. 
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Kelvin-Helmholtz instabilities as the source of 
inhomogeneous mixing in nova explosions 
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Classical novae’” are thermonuclear explosions in binary stellar 
systems containing a white dwarf accreting material from a close 
companion star. They repeatedly eject 10° *-10~° solar masses of 
nucleosynthetically enriched gas into the interstellar medium, 
recurring on intervals of decades to tens of millennia. They are 
probably the main sources** of Galactic '°N, '7O and °C. The 
origin of the large enhancements and inhomogeneous distribution 
of these species observed in high-resolution spectra’ of ejected 
nova shells has, however, remained unexplained for almost half a 
century®. Several mechanisms’, including mixing by diffusion’, 
shear? or resonant gravity waves’®, have been proposed in the 
framework of one-dimensional or two-dimensional simulations, 
but none has hitherto proven successful because convective mixing 
can only be modelled accurately in three dimensions. Here we 
report the results of a three-dimensional nuclear-hydrodynamic 
simulation of mixing at the core-envelope interface during nova 
outbursts. We show that buoyant fingering drives vortices from the 
Kelvin-Helmholtz instability, which inevitably enriches the accreted 
envelope with material from the outer white-dwarf core. Such mixing 
also naturally produces large-scale chemical inhomogeneities. Both 
the metallicity enhancement and the intrinsic dispersions in the 
abundances are consistent with the observed values. 

High-resolution spectra of nova shells, taken before the ejecta have 
undergone substantial modifications through interactions with the 
stellar companions’*”* or with interstellar medium, always show 
highly fragmented, chemically enriched and inhomogeneous shells. 
V1974 Cyg, for instance, although a neon nova, showed large—more 
than threefold—differences in the C/He abundance ratios between two 
knots resolved in the spectrum a few years after outburst’’. A similar 
abundance pattern has also been observed in many other novae, such 
as HR Del 1967 and DQ Her 1934, for which the shells are spatially 
resolved. Comparison of the infrared spectrum with the ultraviolet 
spectrum for the same ions at stages when the former, then the latter, 
turn optically thin shows the same structures to be present even during 
the previous opaque stages”, as do the multiple line systems known for 
decades from optical spectra’”>. 

The nova outbursts (whether involving He-rich, CO-rich or ONe- 
rich white dwarfs) are triggered by nuclear processes dominated by 
CNO-cycle reactions that produce '°N, '*!°O and '’F far in excess of 
solar abundances. Convection begins as soon as the temperature gra- 
dient becomes super-adiabatic, powered by the energy released from 
nuclear reactions (driven mainly by proton captures and 8" decays’, 
with a main nuclear path running close to the valley of stability), and is 
critical to the explosion, transferring a fraction of these abundant, 
short-lived species to the outer envelope layers. The energy released 
when these nuclei decay lifts degeneracy and drives the expansion, and 
ultimate ejection, of the polluted strata’®. Hence, any attempt to 
explain the peculiarities in the abundance pattern observed in nova 
ejecta requires an accurate model of the physical processes that occur 


during the explosion, namely the nuclear processes during the ther- 
monuclear runaway along with the convective mass and energy trans- 
port. The peak temperatures reached during a nova explosion are 
constrained by the chemical abundance pattern inferred from the 
ejecta and do not exceed 4 X 10°K, so it is unlikely that the observed 
metallicity enhancements can be due to thermonuclear processes 
driven by CNO breakout. Instead, mixing at the core-envelope inter- 
face is the more likely explanation. This cannot be modelled in the one- 
dimensional framework traditionally used in nova nucleosynthesis 
simulations, because mixing is inhibited by the one-dimensional 
mixing-length formalism of convection. 

Early attempts in two-dimensional simulations’? have shown that 
the onset of convection at the late stages of the thermonuclear runaway, 
driven by shear flows at the core-envelope interface, will ultimately 
dredge up chemically enriched material into the envelope. However, 
two-dimensional approximations for convection are unrealistic”*”*: 
the conservation of vorticity imposed by the two-dimensional geo- 
metry forces the small convective cells to merge into large eddies, with 
a size comparable to the pressure scale height of the envelope. In 
contrast, in three-dimensional fully developed turbulent convection, 
eddies will become unstable and consequently will break up and fila- 
ment, transferring their energy to progressively smaller scales**’’. 
These structures, vortices and filaments, must undergo a similar fate 
down to roughly the Kolmogorov scale, 1 = (v"/e)'”*, where v is the 
kinematic viscosity and ¢ is the energy dissipation rate. Until now there 
has been no indication of such a cascade in nova hydrodynamic simu- 
lations. We show that with sufficient resolution, proper treatment of 
the nuclear processing, and long time spans, this turbulent energy 
transfer occurs and solves the mixing problem. 

We therefore performed three-dimensional simulations of mixing 
at the core-envelope interface during classical nova explosions with 
the multidimensional, Eulerian, explicit code FLASH. The initial 
model’’ consists of a 1Mj CO white dwarf that accretes solar com- 
position matter (Z = 0.02) at a rate of 5 X 10 °Me yr. The model 
was evolved in one dimension and subsequently mapped onto a three- 
dimensional Cartesian grid of 800 < 800 < 800 km?’, when the tem- 
perature at the base of the envelope reached 10°K (see Fig. 1 and 
Supplementary Information). The model was relaxed to guarantee 
hydrostatic equilibrium. A top-hat 2-km-wide temperature perturba- 
tion®’ (5% amplitude) was imposed close to the core-envelope inter- 
face. As tested thoroughly in two dimensions, the specific choice of the 
initial perturbation (that is, duration, strength, location and size), the 
resolution adopted or the size of the computational domain does not 
have a severe impact on the results of the simulation. The initial per- 
turbation drives a shear flow triggering Kelvin-Helmholtz instabilities 
about 150s after the start of the simulation. Small convection cells 
develop as soon as material is dredged up into the envelope. Moreover, 
the fluid velocity remains below the speed of sound, confirming that a 
nova outburst is driven by a (subsonic) deflagration rather than a 
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Figure 1 | Mixing driven by Kelvin-Helmholtz instabilities. Snapshots of 
the development of Kelvin-Helmholtz instabilities at t= 151s (a), 193s 

(b), 296s (c) and 379s (d), shown in terms of the 1*C mass fraction on a 
logarithmic scale. Dredging of core material driven by Kelvin-Helmholtz 
instabilities translates into a mass-averaged abundance of CNO nuclei in the 
envelope of 0.118, 0.129, 0.157 and 0.182, respectively. The mean CNO 
abundance at the end of the simulation reaches 0.20 by mass (see 
Supplementary Movies). Calculations were performed with FLASH”, a 
parallelized explicit Eulerian code, based on the piecewise parabolic 
interpolation of physical quantities for solving the hydrodynamic equations, 
and with adaptive mesh refinement (with four or five levels of refinement). 
Simulations were run at the MareNostrum supercomputer, requiring 150,000 
CPU hours with 256 (occasionally 512) processors. The typical resolution 
adopted was 3.12 X 3.12 3.12 km°, with a maximum resolution of 

1.56 X 1.56 X 1.56 km’. The three-dimensional computational domain initially 
comprised 112 radial layers—including the outermost part of the CO core— 
and 512 horizontal layers along both horizontal axes. The mass of the accreted 
envelope was about 2 X 10° °Mo. Nuclear energy generation was handled 
through a network of 13 species (‘H, “He, 12130, 1314157 141516170) and 

U7 Ry. connected through 18 nuclear processes. Periodic conditions were 
imposed at the four vertical boundaries of the computational domain, and 
hydrostatic equilibrium with an outflow constraint at the top and a reflecting 
constraint at the bottom was imposed on the velocity at the horizontal 
boundaries”. Other details on the input physics are identical to those adopted 


in earlier two-dimensional simulations**”’. 


(supersonic) detonation. Even for a point-like ignition, the burning 
front quickly spreads horizontally, such that the expansion and pro- 
gress of the explosion proceed in almost spherical symmetry. This 
confirms early estimates” of the velocity of the deflagration front 
spreading through the stellar surface, in the form Vger~ (MpVconv/ 
Thurn) /» where hy is the pressure scale height, v-ony the characteristic 
convective velocity and Tpurn the characteristic timescale for fuel burn- 
ing. Typical values for nova outbursts yield vier = 10* cms; that is, a 
flame propagating halfway across the stellar surface in about 1.3 days. 

At t = 400s, matter crosses the outer computational boundary and 
we stop the calculations because of the Eulerian nature of the FLASH 
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code. At this stage, the envelope base has reached a peak temperature 
of 1.82 X 10°K, and the mean (mass-averaged) metallicity in the 
envelope reaches Z ~ 0.20. This agrees with observations of CO novae 
and with previous two-dimensional modelling”, suggesting that the 
dimensionality of the convective treatment is not crucial to the mean 
values but is important for the details (in particular for an accurate 
description of convective transport). The simulations reveal that these 
last stages are, however, intermittent, and our three-dimensional simu- 
lations seem to resolve at least the upper dissipation range (see 
Supplementary Movies). In that sense, the Kolmogorov scale, 17, pro- 
vides an estimate of the size of the smallest eddies present in the flow. 
Also at this stage, the Reynolds number becomes sufficiently small and 
molecular viscosity is effective in dissipating the kinetic energy into 
heat. 

In our simulations, the burning advances along with the develop- 
ment of persistent density contrasts of large size, comparable to the 
thickness of the layer and much larger than the burning transition 
zone. These become turbulent, and the models require a fully three- 
dimensional treatment to capture the full spectrum of the plumes and 
vortex structures. The resulting abundances from our simulations have 
another particular feature that agrees with and explains the observa- 
tions: the structures are chemically inhomogeneous. This is a relic of 
Rayleigh-Taylor instabilities that grow during the initial stages of the 
ejection. These structures appear within the burning zone (that is, the 
initial plumes) and drive a secondary Kelvin-Helmholtz vortex cascade 
that induces turbulent motions. In this regime of very high Reynolds 
number, the fluid motion is extremely complex and develops structures 
on all scales. In particular, vortices and filaments appear that are a 
signature of the intermittency in classical shear-flow turbulence. 
These are more evident in two-dimensional simulations, but they cascade 
rapidly into smaller eddies and filaments, subject to recombination and 
extension, as the burning continues. Such dissipation is intermittent, as 
predicted in the Kolmogorov-Obukov theory of turbulence*®’, 
generating coherent, persisting structures that advect with the expand- 
ing layers. Because the nuclear reaction rates are density sensitive, 
higher-density knots have a different nuclear history from that of the 
background, and this is best characterized by the abundance distri- 
bution function behind the deflagration. 

This can be clearly seen in Fig. 2, which shows a sample of the time 
evolution of the cumulant function. The initial abundance for °O, our 
trace species (whose abundance is increased by the deep non-uniform 
mixing after the onset of thermal buoyant turbulence), is a narrow 
initial distribution that evolves into a stable form with a lower cutoff 
and a power-law tail towards high abundances. Unlike the single value 
obtained in one-dimensional models, we find a 30% dispersion in the 
main component, fitted by a Gaussian (the dashed line in Fig. 2), and 
containing about 10% of the total volume, an extended, non-Gaussian 
‘fat tail’ whose maximum abundance (at the 1% level) extends up to 
130 from the mean '°O for the volume. Multiwavelength spectroscopic 
analyses during the nova nebular stage, when the ejecta are essentially 
transparent, frequently find large dispersions from line to line in the 
abundance ratios of the principal chemical species. Although usually 
assumed to result from measurement uncertainties, which are fre- 
quently lower than the derived dispersions, this may instead be a 
physically significant result: a signature of the turbulence generated 
during the thermonuclear runaway. 

Spatially resolved knots in several classical novae, notably V1974 Cyg 
(Nova Cyg 1992) demonstrate the reality of this inhomogeneity in the 
ejecta, up to threefold, long before the mixtures are diluted by inter- 
action with the interstellar medium. These were first seen in the first 
weeks of the expansion of the ejecta across different spectral regions, 
from the infrared to the ultraviolet, and were invariant in velocity and 
contrast as each layer was exposed, after the opacity drop in the expand- 
ing medium. The contrast between these residual structures will be 
further amplified by the supersonic motions that follow the stage 
shown here. Although some structuring may result from collision of 
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Figure 2 | Cumulant distributions. Cumulant distribution function for '°O 
(squares), at f= 190s (a), 240s (b), 315s (c) and 397s (d), compared with a 
Gaussian process (dashed line) with the same mean and dispersion. The 
cumulant distribution shows the probability that a variable x has a value equal 
to or lower than a threshold value a (that is, P[x = a]). The cumulant 
distribution is estimated by means of an arbitrary layer 100 km thick 

(5.75 X 10° to 5.85 X 10° cm, with 27,648 data points), a size comparable with 
the pressure scale height. The evolution shows the effects of the intermittent 


the ejecta with the accretion disk around the white dwarf and with the 
companion, the universality of the filamentation—independently of 
the parameters of the binary system and the characteristic times at 
which they are observed—points back to the first moments of the 
explosion. Many spatially resolved nova shells, notably GK Per 1901, 
HR Del 1967 and the recurrent nova T Pyx, provide ample evidence of 
the complexity of the filaments in late stages of the expansion. 

These fluctuations in abundance, density and, in turn, opacity will 
leave imprints not only in the ejecta but also in several aspects accom- 
panying nova outbursts. This will certainly influence the modelling of 
the initial nova outflows, of dust formation (which will condense more 
rapidly in the more '*C-enriched regions that survive within the ejecta) 
and of the maximum achievable luminosities during the explosion (for 
instance the extent to which classical nova explosions can exceed the 
Eddington luminosity and where this occurs). In addition, the long- 
standing problem of the high masses accumulated before the explosion 
and the subsequent discrepancy between the observed and computed 
ejecta masses is naturally resolved by deep mixing of the envelope 
material*®. A persistent problem in one-dimensional models has been 
the need for some dynamical premixing, either by shear instability or 
artificially enhanced convection, to match the observed abundance 
pattern; three-dimensional models, as shown in this paper, require 
neither. 
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Motor antagonism exposed by spatial 
segregation and timing of neurogenesis 


Marco Tripodi!, Anna E. Stepien! & Silvia Arber! 


Walking is a key motor behaviour of limbed animals, executed by contraction of functionally antagonistic muscle groups 
during swing and stance phases. Nevertheless, neuronal circuits regulating the activation of antagonistic extensor- 
flexor muscles remain poorly understood. Here we use monosynaptically restricted trans-synaptic viruses to 
elucidate premotor anatomical substrates for extensor-flexor control in mice. We observe a medio-lateral spatial 
segregation between extensor and flexor premotor interneurons in the dorsal spinal cord. These premotor 
interneuron populations are derived from common progenitor domains, but segregate by timing of neurogenesis. We 
find that proprioceptive sensory feedback from the periphery is targeted to medial extensor premotor populations and is 
required for extensor-specific connectivity profiles during development. Our findings provide evidence for a 
discriminating anatomical basis of antagonistic circuits at the level of premotor interneurons, and point to synaptic 
input and developmental ontogeny as key factors in the establishment of circuits regulating motor behavioural 


dichotomy. 


Walking represents a fundamental manifestation of motor behaviour 
and is based on the selective control of functionally antagonistic 
muscles. This seemingly simple behaviour has been the subject of 
scientific debate for more than a century’. An important entry point 
to understand differential regulation of motor output has been the 
comparative analysis of antagonistic motor neuron pool function. For 
example, extensor and flexor motor neuron pools in the mammalian 
spinal cord innervate distinct limb muscles, which are generally active 
in alternation between an ‘on-ground’ stance and an ‘off-ground’ 
swing phase during walking*. Developmental and genetic studies have 
addressed the role of neuronal populations derived from transcrip- 
tionally defined progenitor domains in motor output regulation, 
providing insight into the genetic identity of neurons required for 
left-right but not extensor-flexor regulation®*. 

Functional studies on the regulation of extensor and flexor motor 
neuron pools provide evidence for selective tuning of antagonistic 
circuits, most prominently featuring the concept of spinal central 
pattern generators and sensory feedback loops. Models emerging 
from this work describe distinct spinal ‘units’ orchestrating stance- 
swing phase alternation of motor output”””"’. Connection selectivity 
between functionally antagonistic circuit elements is also evident for 
sensory input processing to motor neurons and spinal inter- 
neurons*'*"'*. Group Ia proprioceptive sensory neurons transmit 
information from peripheral muscle spindles directly to motor neurons 
and Ia reciprocal interneurons in the ventral spinal cord with high 
synaptic specificity, obeying rules of functional antagonism*’””'. 
Even though these findings illustrate that functional antagonism can 
be observed at many levels, an overall anatomical assessment of 
organizational principles of neuronal circuits relaying information 
to functionally distinct motor neuron pools is currently lacking. 

Here we developed a retrograde reconstruction approach of motor 
output circuitry in the spinal cord to investigate the neuronal ensembles 
directly connecting to extensor and flexor motor neuron pools, the 
premotor interneuron populations. We exploited a monosynaptically 
restricted trans-synaptic rabies virus circuit tracing tool’*** recently 
developed for its use in the motor output system to study systematically 


the distribution of premotor populations”. We provide evidence for 
segregation of extensor and flexor premotor interneuron populations 
based on location, time of neurogenesis and synaptic input. These 
results provide a conceptual framework to understand the regulation 
of antagonistic motor behaviour at the circuit level. 


Spatial segregation of premotor interneurons 


To determine the overall spatial organization of spinal premotor 
interneurons connected to extensor and flexor motor neuron pools 
(Fig. la), we analysed their three-dimensional distribution in the 
lumbar spinal cord of mice by exploiting trans-synaptic virus-based 
methods™. To compare the distribution of premotor interneurons 
connected to motor neurons innervating the ankle extensor muscle 
gastrocnemius (GS) to the ones controlling the ankle flexor muscle 
tibialis anterior (TA) (Fig. 1a), we targeted injections of Rab-mCherry 
to the GS muscle and Rab-eGFP to the TA muscle in postnatal day 3 
(p3) mice, each in conjunction with an adeno-associated virus expres- 
sing the rabies glycoprotein (AAV-Gly). Premotor interneurons ana- 
lysed at p12 were found at spinal levels of the source motor neuron 
pools (lumbar (L) levels L4/5)”°, but also many segments rostrally and 
caudally to it (Fig. 1c, d). 

To analyse the three-dimensional distribution, we assigned x-y-z 
coordinates to labelled premotor interneurons (Fig. 1d; thoracic T11 
to sacral $1; n = 6). Comparison of the overall percentage of labelled 
interneurons did not reveal significant differences between ipsilateral 
and contralateral GS and TA premotor interneurons (Fig. 1b; per- 
centage contralateral: GS, 16.3 + 6; TA, 15.4 + 5). Dorso-ventral dis- 
tribution profiles showed the highest density of ipsilateral TA 
premotor interneurons in a slightly more ventral position than for 
GS (Fig. le), but these differences were not analysed further. We 
observed the most pronounced divergence in interneuron density 
along the medio-lateral dimension ipsilateral to injection, with the 
highest concentration of GS premotor interneurons located signifi- 
cantly more medially than for TA (Figs 1f-h and 2c), a feature inde- 
pendent of the developmental stage of viral injections (Supplementary 
Fig. 1). To generalize our findings, we analysed the distributions of 
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Figure 1 | Spatial segregation of extensor and flexor premotor 
interneurons. a, Scheme of hindlimb ankle and knee extensor (purple) and 
flexor (turquoise) muscles. BF, posterior biceps femoris; GS, gastrocnemius; 
TA, tibialis anterior; VL, vastus lateralis. b, Percentage of contralateral and 
ipsilateral GS/TA premotor interneurons. c, Rostro-caudal distribution of GS/ 
TA premotor interneuron density (T11-S1). The grey box indicates the motor 
neuron pool position. d, GS and TA premotor interneurons on transverse 
spinal cord section (left) and digital coordinates (right). Scale bar: 70 um. 

e, f, Dorso-ventral (e) and medio-lateral (f) density for GS/TA premotor 
interneurons. Top: average of n = 6 animals, Py_, < 0.05, Pn_1< 0.001, Wilcox 
test; bottom: 6 experiments. g-j, Transverse projections of digital 
reconstructions of individual experiments for GS, TA, VL and BF (g, i, dots 
indicate labelled neurons; h, j, density of neuronal distributions; scale in 
micrometres). 


premotor interneurons connected to two additional pairs of extensor 
and flexor motor neuron pools: those innervating the knee extensor 
vastus lateralis (VL) and knee flexor posterior biceps femoris (BF), 
and those innervating the hip extensor gluteus (GL) and hip flexor 
iliopsoas (IL) (Fig. 1a; see Methods). Analysis of these additional 
extensor-flexor premotor interneuron distributions also revealed a 
clear segregation in the highest density along the medio-lateral axis 
(Figs li, j and 2c and Supplementary Fig. 2). Taken together, these 
findings indicate that premotor interneuron density segregation along 
the medio-lateral axis is correlated with the extensor-flexor function 
rather than with the developmental origin of the innervated muscle 
(see Methods). 

Adductor muscles have a mechanical function distinct from 
extensor and flexor muscles to move a limb closer to the sagittal body 
plane, but the adductor muscles of the gracilis (GRA) group are 
recruited at a point that temporally overlaps with flexor muscle con- 
traction’®. Trans-synaptic virus injection into GRA revealed a premotor 
interneuron distribution pattern similar to flexor distributions (Fig. 2a- 
c), with the highest density of GRA premotor interneurons laterally and 
overlapping with the TA premotor interneuron domain (Fig. 2a). 

We next performed hierarchical cluster analysis of the distributions 
of all premotor interneuron ensembles analysed. We found that the 
flexor premotor interneuron populations TA/BF/IL co-segregate, 
together with the GRA group, whereas the extensor groups GS/VL/ 
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Figure 2 | Adductor and flexor premotor interneurons co-segregate. 

a, Transverse section density analysis of GS, TA and GRA premotor 
interneuron distributions (L2; scale in micrometres). b, Medio-lateral density of 
GS, TA and GRA premotor interneurons (L2; GRA versus TA, P > 0.05; GRA 
versus GS, P< 0.0001, Wilcox test). c, d, Medians of medio-lateral position 
(c; P< 0.0001, ANOVA; post-hoc analysis between all flexor-extensor pairs 
P<0.0001, extensor-extensor or flexor-flexor P > 0.05; Tukey’s HSD test) 
and cluster analysis (d; correlation index in arbitrary units; scale to the right) of 
premotor populations investigated. 


GL form a separate cluster (Fig. 2d). Our data therefore support the 
existence of a distinct anatomical representation of interneuron 
groups premotor to functionally different motor neuron pools. 


Dorsal dominance in spatial segregation 


To narrow down the cellular source(s) responsible for the observed 
medio-lateral distribution differences of premotor interneurons, we 
divided the spinal cord into distinct domains. Medio-lateral distri- 
bution differences between extensor and flexor premotor inter- 
neurons were only observed in the ipsilateral spinal cord (Fig. 1f 
and Supplementary Fig. 3). Analysis of medio-lateral interneuron 
segregation at individual rostro-caudal levels showed that differences 
between GS and TA were present throughout but most pronounced at 
rostral lumbar levels, whereas for the VL/BF pair, medio-lateral dis- 
tribution differences were smaller and only statistically significant at 
rostral levels (Supplementary Fig. 4). We next gated our analysis to 
premotor interneurons located dorsally or ventrally to the central 
canal, and found medio-lateral differences exclusively for inter- 
neurons with dorsal cell body position (Supplementary Fig. 5). 
Taken together, these findings reveal a clear medio-lateral segregation 
of extensor and flexor premotor interneurons in the ipsilateral dorsal 
spinal cord. 


Molecular profile of dorsal premotor interneurons 


During development, spinal interneurons emerge from distinct tran- 
scriptionally defined progenitor domains, generating six dorsally 
derived (dI1-6) and four ventrally derived (VO-3) broad interneuron 
classes*”’. dI4—-6 subclasses arise from intermediate spinal progenitor 
domains, express the homeodomain transcription factor Lbx1 and 
encompass both excitatory and inhibitory neurons*”’. Using develop- 
mental lineage tracing*’*' (Supplementary Fig. 6a), we found that the 
Lbx1 progenitor domain contributes to dorsal extensor and flexor 
premotor interneurons with a distribution bias towards extensor 
populations (Supplementary Fig. 6b; GS, 61%; TA, 31% of all dorsal), 
but medio-lateral density distributions were comparable to all dorsal 
premotor interneurons (Supplementary Fig. 6b). 
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Supporting the existence of inhibitory and excitatory premotor inter- 
neurons encompassed within the Lbx1-marked cohort, we found co- 
labelling of extensor and flexor premotor interneurons in GAD67“"” 
and GlyT2°"” transgenic mice”, as well as in GAD65“” Bac trans- 
genic mice with previously described connections between GFP°” neu- 
rons and motor neurons** (Supplementary Fig. 7). In addition, extensor 
and flexor premotor interneurons also encompass T1x3°" neurons 
(Supplementary Fig. 7), consistent with excitatory neurotransmitter 
fate**. Together, these findings show that the Lbx1 progenitor domain 
gives rise to dorsal premotor interneurons with an extensor versus 
flexor bias, but alone does not offer a clean distinguishing arbiter 
between these subgroups, raising the question of other sources for 
distinction. 


Timing of neurogenesis of premotor interneurons 


Birth-dating studies have provided evidence for a temporal neuro- 
genesis gradient in the spinal cord, with the ventral-most neurons 
generated before dorsally derived populations**. Lbx1-derived neurons 
are generated during two waves’, raising the question of whether 
dorsal extensor and flexor premotor interneurons derived from this 
domain are generated at different developmental stages. Pulse-chase 
experiments with two birth-dating tags, sequentially injecting BrdU 
(e10.5) and EdU (e12.5), revealed clear differences along both dorso- 
ventral*® and medio-lateral axes in p12 spinal cords, with neurons 
generated at e10.5 taking on a more ventral and lateral position than 
neurons generated at e12.5 (Fig. 3a and Supplementary Fig. 8). 
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Figure 3 | Timing of neurogenesis separates extensor and flexor 
populations. a, p12 lumbar spinal cord pulsed with BrdU (e10.5) and EdU 
(E12.5) (left), transverse section density analysis (middle; dorsal horn 
excluded), and dorso-ventral and medio-lateral density analysis (right; dorsal 
horn excluded). b, Time of neurogenesis for TA, BF, GS and VL premotor 
interneurons (e9.5-e13.5 daily intervals; +s.d.). c, d, Neurogenesis analysis for 
GS (c) and TA (d) premotor interneurons with e10.5 (turquoise) and e12.5 
(purple) birth-dating tags. Left: BrdU or EdU°"/rabies*" neurons represented as 
single dots (top right inset shows fractional distributions of dorsal premotor 
interneurons at e10.5 versus e12.5; bottom right inset shows density analysis of 
neuronal distributions). Right: representative examples for e12.5 (c) or e10.5 
(d) pulse together with viral labelling of GS (c, purple) or TA (d, turquoise) 
premotor interneurons (left panels show high magnification of birth-dating 
tags). Scale bar: a, 70 um; ¢, d, 10 pm left, 30 pum right. 
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We next gated the birth-dating tags to either extensor or flexor 
premotor interneurons. We found enrichment for flexor premotor 
interneurons (Fig. 3d and Supplementary Fig. 9; TA, 43 + 9%; BF, 
48 + 7%) compared to extensor premotor interneurons generated at 
e10.5 (Fig. 3c and Supplementary Fig. 9; GS, 8 + 3%; VL, 11 + 3%). In 
contrast, a selective gate to the e12.5 birth-dating tag revealed a bias 
for extensor premotor interneurons (Fig. 3c and Supplementary Fig. 
9; GS, 23 + 7%; VL, 26 + 5%) compared to flexor populations (Fig. 3d 
and Supplementary Fig. 9; TA, 4+ 2%; BF, 4+ 1%). Injections at 
daily intervals (e9.5-e13.5) confirmed the separate neurogenesis 
peaks for flexor and extensor premotor interneurons (Fig. 3b). 

We next determined whether similar birth-dating order rules apply to 
neurons derived from defined progenitor domains. We found that 
e10.5-generated Lbx1-derived flexor premotor interneurons outnumber 
their extensor counterparts (e10.5 TA, 23%; e10.5 GS, 4% of all LacZ°"/ 
rabies*” neurons), and that e12.5-generated Lbx1-derived extensor 
premotor interneurons dominate over flexor subsets (e12.5 TA, 8%; 
e12.5 GS, 29% of all LacZ°"/rabies®” neurons). These findings provide 
evidence for a distinction between extensor and flexor premotor inter- 
neuron populations generated from the same progenitor domain 
territory based on timing of neurogenesis. 


Proprioceptors target extensor premotor interneurons 


Spatial segregation of extensor and flexor premotor interneurons 
offers an opportunity for regulation by distinct synaptic sources. 
Proprioceptor-driven disynaptic connections from intermediate 
spinal neurons to extensor motor neurons are well documented by 
electrophysiology*’ *°. Because this area exhibits a seemingly similar 
position to the highest extensor premotor interneuron density, we 
assessed a possible overlap by gating our analysis to this domain. 
Proprioceptive afferents exhibit a major termination zone in 
Rexed’s lamina VI, and synaptophysin-GFP (SynGFP) accumulation 
in proprioceptive terminals shows co-labelling of SynGFP and vGlut1 
(>90%; Fig. 4a and Methods). We found that 56 + 6% of all virus- 
labelled dorsal GS premotor interneuron cell bodies, but only 13 + 8% 
of the dorsal TA population, resided in this area (Fig. 4b, c; L1-L3 
level), and a similar divergence was observed for VL/BF premotor 
interneurons (VL, 58 + 9%; BF, 22 + 2%; Fig. 4c). In agreement with 
the observed co-segregation of GRA and flexor premotor interneuron 
distributions, only 23 + 13% of the dorsal GRA premotor interneuron 
cell bodies resided within the proprioceptive gate (Fig. 4c). In addition 
to their medial enrichment, dorsal extensor premotor interneurons 
also showed diminished accumulation in the domain lateral to the 
proprioceptive gate when compared to flexor populations (2.6-fold 
flexor enrichment; P<0.05, two-tailed unpaired t-test). High- 
resolution image analysis demonstrated that both extensor and flexor 
premotor interneurons resident within the proprioceptor gate are 
targeted directly by vGlutl™ terminals (Fig. 4d; appositions per 
100 uum: GS, 3.0 = 1.2; TA, 2.6 + 0.7), but at values an order of mag- 
nitude higher than in the lateral domain (appositions per 100 jum”: 
GS, 0.2 + 0.2; TA, 0.24 + 0.1). Together, these findings show that 
medio-dorsally located extensor premotor interneurons co-segregate 
with the intermediate spinal cord proprioceptor termination area, 
whereas most of the flexor and GRA premotor interneuron cell bodies 
are positioned more laterally. 


Proprioceptor ablation perturbs connectivity 

To address the question of whether proprioceptive afferents themselves 
are involved in controlling differentiation and/or connectivity of dorsal 
extensor and flexor premotor interneurons, we made use of a genetic 
strategy ablating proprioceptors by diphtheria toxin (PV?" mice)". 
We found that molecular identity and neuronal settling pattern related 
to developmental ontogeny was not perturbed in PV?’ mice 
(Supplementary Fig. 10). Medio-lateral distribution profiles of TA pre- 
motor interneurons in PV?" mice revealed no significant differences 
to wild type, but marked changes were observed for GS premotor 
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Figure 4 | Dorsal extensor premotor interneurons reside in proprioceptive 
termination area. a, Proprioceptive afferent (PA) termination zones in p12 
PV?/ Tayo steP-lox-SynGFP Va mbar spinal cord (motor neurons (MNs): 
ChAT"). SynGFP marks PV°” proprioceptor terminals (left) co-localizing 
with vGlutl (right; high resolution). Diagram (right): proprioceptor 
termination zones in intermediate (yellow) and ventral (grey; MNs) spinal cord 
(dorsal horn terminating afferents: vGlut1°"/SynGFP™). b, GS and TA 
premotor interneuron analysis gated to intermediate spinal cord vGlut®" PA 
termination zone (white). The bottom panel shows medio-lateral density 
distribution of GS/TA premotor interneurons (grey, medio-lateral extent of 
vGlut1°" termination zone). c, Percentage of proprioceptor-embedded dorsal 
GS, VL, TA, BF and GRA premotor interneurons (P < 0.0001, ANOVA; post- 
hoc analysis between all pairs of flexor—-extensor P < 0.0001, extensor-extensor 
or flexor-flexor P > 0.05; Tukey’s HSD test). d, Synaptic contacts between 
vGlut1°" terminals (yellow: reconstructed appositions) and GS premotor 
interneuron. Inset: IMARIS analysis (turquoise, vGlut1; white, apposition 
vGlut1/GS premotor interneuron). Scale bars: a, 70 jim (left), 4 um (right); 

b, 20 tum; d, 4 um (inset, 0.5 jim). 


interneurons (Fig. 5a, b, d, f). Whereas the highest overall density of GS 
premotor interneurons in wild type was found medially, a pronounced 
lateral shift was observed in PV" mice, resulting in a density distri- 
bution profile indistinguishable from flexor (Fig. 5d, f). Unbiased 
monitoring of GS and TA populations by cluster analysis revealed that 
the PV?™ GS population co-segregates with the wild-type TA rather 
than GS population (Fig. 5g), a phenotype evident throughout the 
rostro-caudal extent for both GS/TA and VL/BF pairs (Supplemen- 
tary Fig. 11). By application of digital masks to the area normally 
occupied by proprioceptive afferents in wild-type mice, we found that 
only 6 + 4% of all dorsal GS interneurons in PV” mice were located 
in this now proprioceptor-stripped domain, compared to 56 + 6% in 
wild type (Fig. 5e). 

Group Ia proprioceptive afferents also establish synaptic contacts 
with motor neurons, in addition to their contacts in the intermediate 
spinal cord. To determine whether contact by proprioceptors to the 
intermediate spinal cord is sufficient to implement motoneuronal 
connections by extensor premotor interneurons, we analysed Er81 
mutant mice, which have proprioceptive connectivity to intermediate 
spinal cord but not motor neurons*. Er81 mutant mice revealed no 
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Figure 5 | Proprioceptors control connectivity of extensor premotor 
interneurons. a-d, Transverse (a-c) and medio-lateral (d) density analysis of 
GS and TA premotor interneurons in wild-type (a, d), pyeis (b, d) and 
Er81~'~ (c, d) mice. Scale in micrometres. e, Percentage of dorsal GS/TA 
premotor interneuron cell bodies within proprioceptive afferent (PA) 
termination area (P< 0.0001, ANOVA; post-hoc analysis; GS: PV?" versus 
wild type P < 0.0001, Er81 ~~ versus wild type P> 0.6; TA: PVP" versus wild 
type P> 0.3, Er81~'~ versus wild type P > 0.7, Tukey’s HSD test). f, g, Medio- 
lateral position (f; P< 0.0001, ANOVA; post-hoc analysis, GS: PVv??4 versus 
wild type P< 0.0001, Er81-/~ versus wild-type P > 0.8; TA: PV?" versus 
wild-type P> 0.9, Er81-'~ versus wild-type P > 0.9, Tukey’s HSD test) and 
cluster analysis of median distributions (g; correlation index in arbitrary units; 
scale to the right) of GS/TA premotor interneurons in wild-type, PV?’ and 
Er81~'~ mice. 


significant difference in the medio-lateral density distribution profiles 
of GS extensor premotor interneurons compared to wild-type mice 
(Fig. 5a, c, d, f, g and Supplementary Fig. 11). Together, these findings 
demonstrate that the establishment of direct connections between 
presumptive extensor ‘premotor’ interneurons in the medio-dorsal 
spinal cord is dependent on the presence of proprioceptors in the 
intermediate spinal cord. 


Discussion 


Motor antagonism must be reflected at the neuronal circuit level, 
supporting initiation, propagation and alternation of movement 
based on the changing needs of the system. Our study provides insight 
into the organizational principles of motor output circuitry at the 
global premotor level, just one synapse away from motor neurons. 
This approach has the previously unmet potential to lay open differ- 
ential anatomical three-dimensional connectivity matrices to func- 
tionally defined motor neuron pools, and exposes premotor network 
organization indiscriminate of functional criteria such as neuronal 
activity patterns. More refined analysis of premotor cohorts will 
reveal which neuronal subpopulations can be assigned to specific 
expressions of motor antagonism or other motor circuit modules. 
For instance, although rhythmically active interneurons can be 
observed in the intermediate and ventral spinal cord in adult cats** 
and motor burst generation in neonatal rodents is thought to occur 
ventrally”, the relationship between the premotor network described 
here and antagonistic central pattern generator modules*'®"! awaits 
future investigations. 

We found the most marked organizational differences between 
overall extensor and flexor premotor distributions in the intermediate 
spinal cord with pronounced sensory input, indicating that spatial 
organization here may be important for the function of premotor 
circuits involved in sensory-regulated antagonism. Proprioceptive 
feedback provides an essential arbiter intersecting with spinal circuits 
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to adjust motor output'*. Interestingly, a disynaptic circuit acting on 
lamina V/VI interneurons switching between inhibition at rest and 
excitation during extension has been proposed based on functional 
studies in adult cat*”~*°. Our findings on dorso-medial extensor pre- 
motor interneurons provide a possible anatomical substrate for these 
last-order interneurons as targets for more distant circuit com- 
ponents. They reveal the existence of an evolutionarily conserved 
disynaptic circuit in mammals, in which proprioceptors act as a ‘private 
synaptic channel’ to spatially segregated premotor interneuron popu- 
lations. During development, proprioceptive afferents control mono- 
synaptic connectivity between cognate dorsal extensor premotor 
interneurons and motor neurons. Whether other synaptic sources such 
as defined descending pathways have analogous roles in providing 
input to and influencing connection specificity of lateral extensor 
and flexor populations—imposed by spatial organizational con- 
straints—is an interesting question to pursue. 

Focusing on premotor spinal interneurons, we uncover a spatial and 
developmental motor map at the level of extensor-flexor premotor 
circuits. Our findings provide support for an anatomy-based organ- 
ization of premotor circuits, together with the observation that circuits 
in larval zebrafish are anatomically tuned to motor function during 
development with correlation between dorso-ventral topographic cell 
body organization and movements at changing speed*®. We found that 
the medio-lateral extensor—flexor premotor interneuron segregation in 
the dorsal spinal cord has its origin in temporally divergent neuronal 
generation. Specification of temporal neuronal identity is best studied 
in Drosophila, where neuroblast generation in a temporal sequence is 
accompanied by transcription factor cascades*’”. In mammals, a cor- 
relation between temporal identity and connectivity has been observed 
in cerebellar, hippocampal and olfactory circuits** °°, but whether and 
how these differences relate to function and behaviour remains 
unclear. Previous developmental and genetic studies aimed at under- 
standing circuits underlying functional antagonism in mice were largely 
based on the dorso-ventral progenitor domain origin of neuronal popu- 
lations®’. Our study suggests that temporal identity contributes an 
essential parameter to the identification of the circuit basis underlying 
functional dichotomy in the mammalian spinal cord and provides 
insight into the organizational principles of neuronal circuits control- 
ling motor behaviour. 


METHODS SUMMARY 


For the generation of Tau'?™P!"9"GFP mice, a lox-stop-lox-synaptophysin- 


GFP-Nls-lacZ-pA targeting cassette was integrated into exon 2 of the Tau locus 
as previously described*’. Other mouse strains and intercrosses are described in 
Methods. Virus production and injections were performed as described**’. 
Images were acquired with an Olympus confocal (FV1000) or a custom-made 
dual spinning-disk microscope”. Proprioceptive gates were assigned based on 
vGlutl analysis in Matlab, where a pixel intensity-based mask was assigned to 
each acquired image to delimitate the gate borders. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Generation of mice and mouse genetics. PV (ref. 31), Tau 
(ref. 31), Lbx1°” (ref. 30), GAD65“"” (ref. 34), GAD67"” (ref. 33), GlyT20"” 
(ref. 32), Is2?" (ref. 51) and Er81 mutant” mouse strains have been described 
previously. Intersectional breeding of PV™ (ref. 31) and Ts12"™ (ref. 51) mice was 
used to selectively ablate proprioceptors in this study. At early postnatal stages, 
these mice lack vGlut1" signal in the intermediate and ventral spinal cord*!”* as 
well as peripheral end organs normally innervated by proprioceptors”, demon- 
strating the complete absence of proprioceptive afferents. Wild-type mice and 
intercrosses were maintained on a mixed genetic background (129/C57BL/6), 
and all animal experiments were performed according to Swiss guidelines. 
Virus tracing experiments and timing of neurogenesis. The approach used in 
this study makes use of an attenuated rabies virus expressing fluorescent marker 
proteins (mCherry or eGFP) exhibiting a selective deletion in a glycoprotein (Gly) 
essential for trans-synaptic spreading’*”’. Targeted co-injection of AGly-rabies 
virus (Rab-mCherry or Rab-eGFP) and an adeno-associated virus expressing Gly 
(AAV-Gly) into limb muscles results in retrograde infection and labelling of 
premotor neurons in addition to spinal motor neuron pools innervating the 
injected muscles. Motor neurons are located in Rexed’s lamina IX and can be 
distinguished from interneurons on the basis of choline acetyl-transferase 
(ChAT) expression”. Rab-mCherry, Rab-GFP and AAV-Gly viruses were pro- 
duced and titrated as described**. Muscles were identified and assigned according 
to ref. 53, and injection specificity for every experiment was confirmed using 
fluorescence upon humanely killing the animals. 

BrdU (Sigma)/EdU (Invitrogen) were injected at 100mgkg * in pregnant 
female mice at the desired embryonic stages. For BrdU experiments, mice were 
perfusion fixed and subsequently immersion fixed for 4h in 4% PFA/0.8% picric 
acid. Tissue-Tek OCT embedded spinal cords were cut at 40 im using a cryostat. 
Floating sections were incubated for 30 min at 45 °C in 1 M HCL, briefly washed in 
PBS, incubated for 10min in borate buffer pH 8.0 followed by another PBS 
washing step. After this treatment, EdU was revealed as described™ (Invitrogen 
Click-it EdU). After EdU staining, mouse anti-BrdU was added over night at 4 °C 
and revealed with fluorescent-conjugated secondary antibodies (Invitrogen). 
Rationale for choice of muscles for premotor interneuron analysis. The fol- 
lowing muscles were chosen for analysis: tibialis anterior (TA), posterior biceps 
femoris (BF), iliopsoas (IL), gastrocnemius (GS), vastus lateralis (VL), gluteus 
(GL) and gracilis (GRA). These muscles were selected based on the following 
criteria: (1) the mechanical actions of the muscles on joints, taking into account 
the anatomical muscle attachment sites through tendons; (2) the reflex response 
patterns as defined previously’, providing a comprehensive reflex analysis in the 
cat, and defining flexor muscles as activated and extensor muscles as unaffected or 
inhibited by flexor reflex afferent stimulation; (3) the phase of muscle recruitment 
during forward locomotion, based on EMG data in in vivo preparations, pref- 
erentially in mice or rats, but referring to cat data when such data are not available 
from mice or rats. As defined in our study, extensor muscles (GS, VL, GL) act 
mechanically as extensors, are inhibited or unaffected during the flexion reflex 
and are recruited during stance phase in forward locomotion in vivo, whereas 
flexor muscles (TA, BF, IL) act mechanically as flexors, are activated during the 
flexion reflex and are recruited during swing phase in forward locomotion in vivo. 

In more detail, GS acts as an ankle extensor, is recruited during stance in mice 
in vivo” and is inhibited during the flexion reflex in cat’. VL acts as a knee 
extensor, is recruited during stance in mice in vivo”, and is inhibited during 
the flexion reflex in cat*. GL acts as a hip extensor, is recruited during stance 
phase in forward locomotion” and is unaffected by the flexion reflex in cat*. TA 
acts as ankle flexor, is recruited during swing phase in forward locomotion in 
mice in vivo” and is excited during the flexion reflex in cat*. The choice of BF is 
more complex owing to the bi-functional nature of this muscle, but its choice was 
obligatory due to the absence of pure knee flexor muscles in mice. For this reason, 
we chose to inject the posterior head of the BF, which inserts to the distal shank, 
and as a consequence, this head has a prominent mechanical role in knee flexion, 
with a hip-to-knee lever arm ratio of 3:5 (ref. 57) Consistent with the more 
prominent role of this head in knee flexion, it is active during swing” and it is 
excited during the flexion reflex*. IL acts as hip flexor, is recruited during swing 
phase in forward locomotion in vivo’*”* and is excited during the flexion reflex’. 
GRA was chosen as an example of a muscle with a more complex function. Albeit 
acting mechanically as knee flexor, it also acts as an adductor of the thigh’. GRA 
therefore shows a mechanical function distinct from all other muscles examined 
in this study. GRA is consistently recruited during swing phase in forward loco- 
motion in vivo*® and is excited during the flexion reflex’, but a more complex 
reflex response pattern has also been reported”®. With respect to locomotor cycle 
recruitment phase, BF and GRA have also been suggested to form a separate 
group as retractors, muscles active to complete the backward movement of foot 
and toes and to lift the toes”. 
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Our analysis and choice of muscles also included a component of assessing the 
developmental origin of a limb muscle and columnar identity of motor neurons 
innervating the muscle. Genetic programmes guide the establishment of axonal 
trajectories of motor neurons residing in the medial and lateral aspect of the lateral 
motor column (LMCm, ventral limb projections; LMCI, dorsal limb projections) to 
muscles of distinct developmental origin”. For instance, GS and BF are muscles of 
ventral developmental origin and are innervated by LMCm motor neurons, 
whereas TA and VL are muscles of dorsal origin and innervated by LMC] motor 
neurons**”. The study of premotor interneurons connected to these motor neuron 
pools therefore also allows addressing whether the observed density segregation is 
linked to the extensor and flexor function or the developmental origin of the muscle. 
Immunohistochemistry and imaging. Antibodies used in this study were: chicken 
anti-GFP (Invitrogen), guinea-pig anti-Tlx3 (ref. 61) (gift from C. Birchmeier, 
Berlin), guinea-pig anti-vGlutl (Chemicon), goat anti-ChAT (Chemicon), goat 
anti-LacZ (Biogenesis), mouse anti-BrdU (Invitrogen, MoBU-1 clone), rabbit anti- 
GFP (Invitrogen), rabbit anti-LacZ (Invitrogen), rabbit anti-PV (Swant) and rabbit 
anti-RFP (Chemicon). Spinal cords were sectioned and processed for immuno- 
histochemistry as previously described’ with the exception of BrdU experiments 
(see above). 

In the ventral spinal cord, vesicular glutamate transporter 1 (vGlut1) identifies 
synaptic terminals of proprioceptors”. Mechanoreceptive sensory neurons with 
deep dorsal horn projections also express vGlutl (ref. 63), prompting us to 
determine whether vGlut1 faithfully marks proprioceptive terminals in the inter- 
mediate spinal cord proprioceptor termination area. Expression of the fusion 
protein synaptophysin-GFP (Syn-GFP) in proprioceptors of PV”*/ 
Tau sole S"GEP mice highlights the proprioceptor termination area in the 
intermediate spinal cord, in which >90% of terminals were co-labelled for 
SynGFP and vGlutl at p12 (Fig. 5a). 

Synaptic input (vGlut1°") to dorsal GS and TA premotor interneurons was 
reconstructed using IMARIS software, for neuronal populations residing inside 
or outside the proprioceptive gate. Images were deconvolved using Huygens. 
Neuronal surfaces were reconstructed using the IMARIS surface tool, including 
the unambiguously assigned dendrites, before the overall surface area was calcu- 
lated for each reconstructed neuron. Appositions between vGlut1 terminals and 
premotor interneurons were determined by co-localization analysis in IMARIS, 
and resulting assigned synaptic contacts were extracted and counted in IMARIS 
using the Spots function. Values in the text express the number of appositions per 
100 um? of reconstructed neuronal surface. For each sampled class (TAinside 
TAcutside GSinsidee GSoutside)» Synaptic input to =30 neurons was determined. 
Statistics. The custom-made MATLAB plug-in ‘Reference Axes’ was developed to 
reconstruct the position of virally labelled neurons in a three-dimensional Cartesian 
space, allowing the comparison of multiple specimens. The plug-in runs in the 
previously developed MATLAB-based image processing suite ‘Qu’. In the 
Reference axes plug-in, a set of Cartesian axes is constructed interactively over all 
frames of a z-series with the zero centred on the central canal, the y axis parallel to 
the midline of the spinal cord and the x axis orthogonal to it. The position of virally 
labelled interneurons detected by means of Qu’s spot detection algorithm is trans- 
formed (translation and rotation) to a common coordinate system. Corrected spot 
positions can be further processed in Qu or exported as comma-separated value 
files. All statistical analysis and data set visualization were performed using R (R 
Foundation for Statistical Computing, http://www.r-project.org). One-dimensional 
kernel density estimates were obtained using the R ‘density’ function. Two- 
dimensional kernel density estimation used to compute the distribution contours 
was obtained using the ‘kde2d’ function provided in the ‘MASS’ library. Two- 
dimensional kernel density estimation were graphically displayed as contour plots, 
with the contour lines connecting points of equal densities and drawn for density 
values between 30% and 100% of the estimated density range, in steps of 10%. 
Correlation matrices were computed as a function of the distances between distri- 
bution medians. For all box plots shown, the horizontal bar inside the box indicates 
the median value, box limits are set at the 25th and 75th percentile of the distri- 
bution, whiskers extend to accommodate values up to 1.5 times the interquartile 
range, values outside this range are plotted as single points. For linear density 
distributions, integral of curve always equals 1, and figure panels do not display 
y-axis values. To determine the relative ratio of extensor to flexor premotor inter- 
neurons in the dorsal spinal cord lateral to the proprioceptive gate, connectivity 
indices were normalized to the number of contralateral premotor interneurons, a 
population without any obvious extensor-flexor bias. Flexor premotor inter- 
neurons were significantly enriched (2.6%) when compared to their extensor 
counterparts (GS, 2.4+1.3 x GScontra; TA, 6.2+ 1.2 TAcontra; P< 0.05, 
two-tailed unpaired t-test) in wild-type mice. 
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So far, no common environmental and/or phenotypic factor has 
been associated with melanoma and renal cell carcinoma (RCC). 
The known risk factors for melanoma include sun exposure, pig- 
mentation and nevus phenotypes’; risk factors associated with 
RCC include smoking, obesity and hypertension’. A recent study 
of coexisting melanoma and RCC in the same patients supports a 
genetic predisposition underlying the association between these 
two cancers*. The microphthalmia-associated transcription factor 
(MITF) has been proposed to act as a melanoma oncogene’; it also 
stimulates the transcription of hypoxia inducible factor’ (HIF1A), 
the pathway of which is targeted by kidney cancer susceptibility 
genes®. We therefore proposed that MITF might have a role in 
conferring a genetic predisposition to co-occurring melanoma 
and RCC. Here we identify a germline missense substitution in 
MITF (Mi-E318K) that occurred at a significantly higher frequency 
in genetically enriched patients affected with melanoma, RCC or 
both cancers, when compared with controls. Overall, Mi-E318K 
carriers had a higher than fivefold increased risk of developing 
melanoma, RCC or both cancers. Codon 318 is located in a 
small-ubiquitin-like modifier (SUMO) consensus site (‘YKXE) 
and Mi-E318K severely impaired SUMOylation of MITF. Mi- 
E318K enhanced MITF protein binding to the HIF1A promoter 
and increased its transcriptional activity compared to wild-type 
MITF. Further, we observed a global increase in Mi-E318K- 
occupied loci. In an RCC cell line, gene expression profiling 
identified a Mi-E318K signature related to cell growth, prolifera- 
tion and inflammation. Lastly, the mutant protein enhanced 
melanocytic and renal cell clonogenicity, migration and invasion, 
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consistent with a gain-of-function role in tumorigenesis. Our data 
provide insights into the link between SUMOylation, transcription 
and cancer. 

MITF encodes a member of the Myc supergene family of basic 
helix-loop-helix zipper transcription factors. It has a complex 
intron-exon structure, yielding protein products with different amino 
termini’. The M-isoform (expressed in the melanocyte lineage) of 
MITF regulates expression of a large set of genes promoting prolifera- 
tion and invasion. For instance, it controls the expression of MET? and 
CDKN2A/p16INK4A’*, which have key roles in melanoma develop- 
ment. The role of the A-isoform of MITF (expressed in kidney) in 
renal cell transformation has yet to be elucidated. However, MITF 
belongs to the same family of transcription factors (the MiT family) 
as the TFEC, TFE3 and TFEB genes; the two latter were identified as 
targets of somatic recurrent translocations in a RCC subtype found 
predominantly in children and young adults’. 

We sequenced the MITF gene in 62 patients with melanoma and 
RCC. Five patients exhibited a germline heterozygous missense sub- 
stitution, p.E318K (c.952G>A, in MITF isoform 4, NCBI accession 
NM_000248.3) (Fig. 1a). The frequency of this variant was signifi- 
cantly higher in patients with both melanoma and RCC (melanoma 
+ RCC) than in 1,659 population-based cancer-free controls 
(P= 1.3 X 10 “). Carriers of the p-E318K variant (Mi-E318K) exhibited 
a 14-fold higher risk than controls for developing melanoma + RCC 
(Table 1). To investigate the effect of Mi-E318K on susceptibility 
to melanoma alone, we genotyped 603 affected patients that had under- 
gone oncogenetic testing for melanoma susceptibility genes’ and 
were negative for CDKN2A and CDK4 mutations (Table 1). Mi-E318K 
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Figure 1 | Mi-E318K mutation impairs MITF SUMOylation. a, MITF 
sequence data of the c.952G>A, p.E318K germline substitution (Mi-E318K). 

b, HEK293 cells were co-transfected with empty pcDNA3 vector (EV) or pcDNA3 
encoding wild-type (Mi-WT) or mutant (K182R, Mi-E318K or K182R+E318K) 


frequency was significantly higher in these patients than in controls 
(P=7.8X 10°). The carriers of Mi-E318K in this group exhibited 
>4-fold higher melanoma risk than controls (Table 1). In addition, Mi- 
E318K co-segregated with melanoma in three melanoma-prone families 
that had DNA available in relatives of index cases carrying Mi-E318K 
(Supplementary Fig. 1a). Wealso found that Mi-E318K was more strongly 
associated with multiple occurrences of primary melanomas than with 
single occurrence, irrespective of family history (P = 0.02, Supplementary 
Table 1). To investigate the effect of Mi-E318K in RCC susceptibility, we 
genotyped 164 patients with RCC referred to oncogenetic clinics, who 
were enriched for rare histological subtypes and who were wild type for the 
known RCC-predisposing genes’. These patients also exhibited a higher 
frequency of Mi-E318K than controls (P = 0.008) (Table 1). After cor- 
recting for multiple testing, the association between Mi-E318K and mel- 
anoma + RCC, melanoma only or RCC only remained significant, 
although marginally so for the RCC-only group (P = 0.02). The three 
groups showed similar Mi-E318K allele frequencies (P = 0.10). When 
all 829 patients with melanoma and/or RCC were pooled, Mi-E318K 
frequency was significantly different from controls (P = 1.2 X 10°). 
The pooled group of Mi-E318K carriers had a greater than fivefold risk 
of developing melanoma, RCC or both cancers, as compared to controls 
(Table 1). The clinical and pathological features of the 27 Mi-E318K 
carriers are described in Supplementary Table 2. Principal component 
analysis (PCA) of single nucleotide polymorphism (SNP) data across the 
genome in more than 75% of cases and controls showed appropriate 
clustering of cases and controls after exclusion of a few subjects of 
non-European ancestry (Supplementary Fig. 1b). Association analysis 
of melanoma and/or RCC with Mi-E318K provided similar results when 
performed with or without adjusting for principal components 
(Supplementary Table 3). To assess whether Mi-E318K predisposed to 
the coexistence of melanoma and a second primary neoplasm other than 
RCC, we gathered data from European countries, but these data were 
insufficient for separate analyses of each type of cancer. Overall, we con- 
cluded from our findings that Mi-E318K isa rare substitution that confers 
an increased risk for developing melanoma and/or RCC. However, given 
the selection of patients who were part of case series undergoing genetic 
testing, further investigation of Mi-E318K in large series of unselected 
sporadic renal cancer and melanoma cases is merited. 


Table 1 | Frequency of the germline Mi-E318K substitution 
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Myc-tagged MITF and either with an empty vector or His-SUMO1 pSG5 vectors. 
Western blot analysis with antibodies to MITF, SUMO1 and ERK2 used as a 
loading control. c, Western blot analysis with antibodies to MITF and ERK2 of 
protein extracts before (Input) and after Ni-NTA affinity purification. 


Previous studies have shown that MITF is expressed in a majority of 
melanomas and in a subset of kidney tumours’. Accordingly, our 
immunohistochemical analyses showed that MITF was expressed in 
both melanomas from patients carrying the germline mutation Mi- 
E318K (n= 8) and in melanomas from wild-type patients (n = 8) 
(Supplementary Fig. 2a, b). In addition, two out of seven RCCs from 
Mi-E318K carriers were positive for MITF (Supplementary Fig. 2c, d), 
suggesting that Mi-E318K might have a role in renal transformation. 
None of the six wild-type MITF RCC samples analysed in this study 
showed MITF labelling. 

Mi-E318K occurs at a conserved position in MITF within a consensus 
motif (IKQE) that matches perfectly with the consensus sequence 
VEXE for covalent binding of SUMO” (Supplementary Fig. 3a, b). 
Because SUMOylation is critically dependent on the acidic residue at 
+2 (E) of the acceptor lysine (K), we tested whether Mi-E318K affects 
MITF SUMOylation. Co-expression of histidine-tagged (His)-SUMO1 
with wild-type MITF and western blot analysis revealed a 120-kDa band 
anda doublet above 95 kDa, in addition to the 55-65 kDa native doublet 
(Fig. 1b). The p.K182R mutation introduced within the second MITF 
SUMOrylation site led to complete disappearance of the 120-kDa MITF 
form, but had little effect on the 95-kDa doublet. When glutamic acid 
residue 318 was replaced by a lysine, mimicking the germline mutation, 
we observed a strong decrease in the levels of all the SUMO- 
modified forms of MITF. Lastly, no SUMOylated forms were seen with 
the double-mutant [p.K182R;p.E318K]. Similar observations were 
made with co-expression of haemagglutinin-tagged (HA)-SUMO2 
(Supplementary Fig. 4a), demonstrating that Mi-E318K affected both 
SUMO] and SUMO2 modifications. Western blot analysis with anti- 
MITF antibody, after affinity purification of His-SUMOl1-containing 
proteins on Ni-NTA columns, confirmed that the high molecular 
weight bands were indeed SUMO-modified MITF proteins and that 
Mi-E318K markedly reduced all the MITF SUMOylated forms 
(Fig. 1c). Altogether, these results demonstrated that codon 316 was 
the major SUMO acceptor site in MITF and that Mi-E318K severely 
impaired SUMO conjugation to MITF. 

SUMOylation has been shown to orchestrate a variety of cellular 
processes, in part through the control of nucleo-cytoplasmic signal 
transduction. However, both Mi-E318K and wild type were detected 


Subjects Number of non-carriers Number of carrierst Total Frequency of p.E318K FET P valuet OR (95%Cl)* 
Controls 1,649 10 1,659 0.003 - Reference 

Patients with melanoma and/or RCC 802 27 8298 0.016 1.2 x 10° 5.55 (2.59-12.91) 
Melanoma + RCC 57 5 62 0.040|| 1.3 x 10-4 14.46 (3.74-48.04) 
Melanoma only 586 17 603 0.014 || 7.8 x 107° 4.78 (2.05-11.75) 
RCC only 159 5 164 0.015|| 0.008 5.19 (1.37-16.87) 


*OR (95% Cl) is the odds ratio (with 95% confidence interval) associated with the Mi-E318K carrier status. 
+ FET, Fisher's exact test for the difference in Mi-E318K allele frequency between each group of patients and controls. 
fAll carriers are heterozygotes for the Mi-E318K substitution. Clinical and pathological features of the 27 carriers are described in Supplementary Table 2. 


§ A full description of the 829 patients is given in Methods. 


|| The Fisher's exact test showed no significant differences (P = 0.10) in allele frequency among the three groups of patients (melanoma + RCC, melanoma only, RCC only). 
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by immunofluorescence in the nucleus of transfected melanoma cells. 
This indicated that Mi-E318K did not affect MITF nuclear localization 
(Supplementary Fig. 4b, c). 

Previous reports indicated that SUMOylation of MITF repressed its 
transcriptional activity, particularly when the targeted promoter con- 
tained multiple E-boxes’®. Indeed, on a synthetic reporter gene com- 
prising three E-boxes, Mi-E318K showed two- to threefold greater 
transcriptional activity than that of wild type (Fig. 2a). The double 
MITF mutant, [p.E318K;p.K182R], was more effective than Mi- 
E318K, and the single p.K182R mutant exhibited activity comparable 
to that of wild type. We next tested whether Mi-E318K affected MITF 
activity on physiological target promoters, such as MET and HIFIA, 
two MITF-regulated genes involved in cell survival and in melanocyte 
and kidney tumorigenesis. We also tested CDKN2A and TYR promoters, 
which are involved in melanocyte proliferation and differentiation 
processes, respectively. Wild type and mutant Mi-E318K showed 
similar activity on the MET (Fig. 2b), TYR and CDKN2A promoters 
(Supplementary Fig. 5), but Mi-E318K more efficiently activated the 
HIF1A promoter than wild type (Fig. 2c). This observation suggests 
that Mi-E318K alters MITF transcriptional activity on a subset of its 
target genes. To determine the global transcriptional effects of Mi- 
E318K, we performed pan-genomic expression profiling of A375 mel- 
anoma and RCC4 cells infected with adenoviruses that encoded either 
wild-type MITF or Mi-E318K (Supplementary Fig. 6a, b, c, d). Analysis 
of the genes regulated differentially by wild type and Mi-E318K in 
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Figure 2 | Mi-E318K increases MITF binding and transcriptional 
regulation of a subset of its target genes. a-c, Reporter plasmids containing 
the luciferase gene under the control of different MITF target promoters were 
co-expressed with empty pcDNA3 (EV) or pcDNA3 encoding wild-type (Mi- 
WT) or mutant (K182R, Mi-E318K, K182R+E318K) MITF. a, HEK293 cells 
were transfected with a 3M-box reporter. b, c, 501mel human melanoma cells 
were transfected with the c-MET promoter reporter (b) or the HIF1A promoter 
reporter (c). The results were expressed as the fold stimulation over basal 
luciferase activity. Data represent the mean + s.d. of three independent 
experiments. Asterisk indicates a statistically significant difference (Student’s 
t-test, P= 0.05). d, MITF-occupied loci in genomic DNA extracted from 
501mel melanoma cells expressing Mi- WT or Mi-E318K. MITF tag density was 
compared in the region of +5 kb around the MITF-occupied loci. e, University 
of California Santa Cruz (UCSC) view of MITF occupancy of HIF1A and 
ABCBS loci. The blue bar with arrowheads below each panel indicates the 
transcription start site; the E-box nucleotide sequence is shown under peaks. 
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A375 cells failed to identify a specific Mi-E318K signature. However, 
in RCC4 cells, we identified a Mi-E318K signature composed of 32 
genes (Supplementary Table 4). On the basis of the bioinformatic 
analysis using Ingenuity Pathways Analysis (IPA) software, these genes 
were associated with cell growth, proliferation and inflammation 
(Supplementary Fig. 7). Among the genes downregulated by Mi- 
E318K compared to wild type, IRAK2 (ref. 11), EGR1 (ref. 12) and 
IL6 (ref. 13) have tumour suppressor functions. The upregulated genes 
included CCR7, a HIF-1a target gene’* and activator of the NF«B 
pathway’; ABCB5, one of the most well-documented markers for 
melanoma initiating cells’; GADD45G, a member of the GADD45 
family of stress sensors, which connects NFKB to the MAP kinase 
pathway’; and TRIM63, which was one of the top five upregulated 
genes and was the major gene found to be differentially expressed in 
patients with TFE3/TFEB translocation RCC as compared to other 
subtypes of RCC*. These experiments confirmed the different tran- 
scriptional potentials of wild type and Mi-E318K, particularly on genes 
involved in proliferation and inflammation. 

To investigate whether SUMOylation influenced genome-wide 
MITF occupancy of its target sites, we performed chromatin immuno- 
precipitation coupled to high-throughput sequencing (ChIP-seq) 
experiments in 501mel human melanoma cell lines expressing wild 
type or Mi-E318K. Data analysis shows that 22,157 sites are occupied 
by Mi-E318K, whereas only 9,107 were detected for wild type, the 
difference being the presence of a large number of weakly occupied 
sites in the Mi-E318K data set. Comparative analysis of the ChIP-seq 
data sets using read density matrix clustering identified sites that are 
occupied by both wild type and Mi-E318K (Fig. 2d, sets B and E), 
whereas some promoter sites are uniquely or preferentially occupied 
by Mi-E318K (Fig. 2d, sets A and D; Supplementary Table 5). For 
example, a higher occupancy of the HIFIA promoter by Mi-E318K 
compared to wild type is observed (Fig. 2e). Mi-E318K also showed 
increased binding to the ABCBS5 locus (Fig. 2e), the expression of 
which is increased in RCC4-expressing Mi-E318K. Of note, Mi- 
E318K binds more efficiently than wild type to the HMOX1 promoter 
(Supplementary Table 5), a gene involved in both kidney cancer'* and 
melanoma cell growth’’. Although the HIFIA promoter was better 
occupied and activated (in a gene promoter assay) by Mi-E318K 
than by wild type, we observed equal amounts of HIFIA messenger 
RNA in Mi-E318K- or wild-type-infected A375 and RCC4 cells. 
HIF1A transcripts are subjected to post-transcriptional regulation that 
controls their stability”®; this may mask the transcriptional effect of Mi- 
E318K. Alternatively, the cellular context or the chromatin landscape 
in RCC4 or A375 cells may not permit increased transcription. Indeed, 
this was observed for MITF target genes such as TYR, TYRP1 and DCT, 
which were not upregulated by MITF overexpression in A375 cells. 
Taken together, our data indicate that the naturally occurring Mi- 
E318K severely impaired MITF SUMOylation and showed higher 
global transcriptional activity, in agreement with the currently accepted 
model of SUMO-mediated transcriptional repression*’. The global 
increase in Mi-E318K-occupied loci coupled with the existence of sites 
exclusively bound by the mutant protein indicate that SUMOylation 
regulates the repertoire of MITF target genes. SUMOylation-deficient 
Mi-E318K protein may therefore result in the regulation of distinct sets 
of genes, hence leading to gain-of-function properties. 

To evaluate the tumorigenic potential of Mi-E318K, we tested its 
effect on migration, invasion and colony formation of stable melanoma 
cells (501mel) expressing wild-type MITF or Mi-E318K (Fig. 3a). We 
observed increased migration (Fig. 3b) and invasion (Fig. 3c) in two 
different Mi-E318K clones compared to wild-type clones. Similar 
results were obtained in VHL-deficient RCC4 cells (Fig. 3d, e) infected 
with adenoviruses encoding wild type or Mi-E318K. In contrast, Mi- 
E318K caused barely significant effects in A375 cells (used also for 
expression profiling) (Supplementary Fig. 8). Lastly, in a colony-form- 
ing assay with immortalized murine melanocytes, Melan-a (Fig. 3f), 
and RCC4 cells (Fig. 3g), the number of colonies after transfection of 
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Figure 3 | Mi-E318K-enhanced migration, 
invasion and clonogenicity of melanoma and 
renal cancer cells. a, Western blot analysis with 
antibodies to MITF or ERK2 of 501mel cells stably 
transfected with empty vector (EV), or vector 

7" encoding Mi-WT or Mi-E318K (clones 1 and 2). 
y MW, molecular weight. b, c, Migration (b) and 
invasion (c) assays with 501mel melanoma cells 
described earlier. d, e, Migration and invasion 
(e) assays in RCC4 cells transduced with empty 
adenovirus (EV) or adenoviruses encoding either 
Mi-WT or Mi-E318K. b-e, Photographs above 
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Mi-E318K was enhanced compared to wild-type-transfected or control 
cells. Overall, these data indicated that Mi-E318K was more potent than 
wild type for promoting invasive and tumorigenic behaviours in mel- 
anoma and RCC cells. However, Mi-E318K did not significantly stimu- 
late melanoma or RCC cell growth (data not shown). These features are 
reminiscent of melanoma cell populations with increased invasive and 
division potential but with a slow growth rate, which are considered to 
be melanoma-initiating cells. Thus, Mi-E318K might favour a pheno- 
typic switch of melanoma cells towards a tumour-initiating cell pheno- 
type, possibly in synergy with hypoxia”’. 

Indeed, both the epidermis (where melanocytes are located) and the 
inner renal medulla are physiologically hypoxic tissues***°. Hypoxia is 
capable of rapidly inducing HIF1A to initiate a cell-survival response”®. 
Of note, several genes identified by expression profiling or ChIP-seq 
(HIFLA, CCR7, HMOX1) function in the hypoxia pathway. Both ultra- 
violet exposure and hypoxia generate radical oxygen species (ROS). In 
turn, oxidative stress activates inflammatory pathways that can lead to 
cellular transformation, proliferation and stem cell survival, among 
other features”. Our IPA analysis associated the Mi-E318K signature 
with inflammation, cell proliferation and cancer. Various environ- 
mental stresses, including hypoxia and ROS, were previously shown 
to induce global protein SUMOylation”*. In this context, Mi-E318K, 
which prevents MITF SUMOylation, could impair the adaptation of 
cells to stress and initiate tumour formation. Our study has wide impli- 
cations for understanding the role of MITF and its SUMOylation in the 
physiology and tumorigenesis of melanocytes and kidney cells. 

We have identified a rare oncogenic germline substitution, Mi- 
E318K, that predisposes to both melanoma and RCC. Interestingly, 
MITF loss-of-function mutations are responsible for an inherited dis- 
order in neural crest cell development, the type 2a Waardenburg syn- 
drome’. This is reminiscent of the RET oncogene, in which activating 
germline mutations predispose to medullary thyroid carcinoma, but 
loss-of-function germline mutations predispose to Hirschsprung’s dis- 
ease, a congenital absence of enteric neurons in the gastrointestinal 
tract”. Our data indicate that mutation screening in genetically 
enriched patients is a powerful strategy to identify rare genetic vari- 
ation that confers a moderate risk of cancer. Lastly, it highlights the 
relevance of hereditary tumour models for shedding light on cell- 
growth-related signals and identifying cancer driver genes. 


METHODS SUMMARY 


Participants were included with the approval of an institutional review board. 
Sanger sequencing of MITF isoforms was performed in 62 melanoma + RCC 
cases. Mutations numbering referred to NCBI accession NM_000248.3. Wild-type 
or Mi-E318K genotyping was performed in 1,659 controls and 829 cases by 
TaqMan; positive samples were checked by Sanger sequencing. Genome-wide 
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each bar graph show a representative field of the 
underside of filter after cell migration or invasion; 
magnification, X 200. f, g, Mi-E318K increased the 
number of colonies formed with Melan-a or RCC4 
cells, respectively. Photographs of the colonies were 
taken at 2 weeks. b-g, Bars show the mean + s.d. of 
three replicate assays. b-g, Student’s t-test 
compared Mi-WT to Mi-E318K. *P < 0.05, 

**P <(0.01 and ***P< 0.001. 
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SNP data were genotyped by the Centre National de Génotypage (CNG, CEA) 
using Illumina technology. PCA was applied to the SNP data using EIGENSTRAT 
software. Immunohistochemistry was performed on melanoma and kidney fixed 
samples with anti-MITF antibody (Abcam, clone D5). Mutations in the two 
SUMO] fixation sites, p.K182R and p.E318K, were generated with the 
QuickChange method in wild-type MITF-M and MITF-A isoforms and verified 
by sequencing. Plasmids were transfected into HEK293, 501mel, Melan-a and 
RCC4 cells with FuGENE 6 reagent (Roche Applied Science). Western blotting 
and immunofluorescence were performed with antibodies to MITF (Abcam, clone 
C5), HA-tag, SUMO1 or ERK2 (Santa Cruz Biotechnology). Reporter assays were 
performed by transient transfection in triplicate with lipofectamine (Invitrogen), 
and pCMV fGal was included to control transfection efficiency. Gene expression 
microarray assays were performed with Agilent technology by the Institut de 
Cancérologie Gustave Roussy (IGR) genomics platform. ChIP-seq experiments 
were performed by the Institut de Génétique et de Biologie Moléculaire et 
Cellulaire (IGBMC) microarray and sequencing platforms. Colony forming assays 
were performed after 14 days of culture; cells were fixed, stained with 0.4% crystal 
violet, and photographed. Statistical analyses for comparison of allele frequencies 
between cases and controls were performed with Fisher’s exact test. All computa- 
tions were performed with Stata software, version 11. For functional assays, stat- 
istical significance was evaluated with the Student’s t-test. Results were considered 
significant when the Student’s t-test, *P value = 0.05, **P = 0.01, ***P = 0.001. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Study participants. Patients with melanoma and RCC. Sixty-two patients that 
had developed both melanoma and RCC were included in the MELARISK col- 
lection at the Institut de Cancérologie Gustave Roussy (IGR) and other French 
hospitals, 71% were males. None of these patients carried a germline CDKN2A, 
CDK4 or VHL pathogenic mutation. 

Patients with melanoma only. The patients with melanoma were enrolled 
through a nationwide network of French dermatology and oncogenetic clinics that 
constituted the Familial Melanoma Project and the MELARISK collection. The 
present study investigated a total of 371 index cases from independent pedigrees 
with a family history of melanoma (at least two melanoma cases) and/or pancreatic 
cancer, and 232 sporadic cases diagnosed with multiple primary malignant mel- 
anomas. A subset of the families (N = 34) included a melanoma patient and 
relatives with pancreatic cancer. These were part of the CDKN2A mutation testing 
procedure; carriers of the CDKN2A mutation have high risk of developing pan- 
creatic cancer*'. All cases had confirmed diagnoses of malignant melanoma 
through medical records, review of pathological material, and/or pathological 
reports. For all these patients, mutation screening for CDKN2A (exon 1f, 1a, 2 
and 3) and CDK4 (exon 2) had been undertaken for molecular diagnosis purposes. 
None of these patients carried a known CDKN2A or CDK4 pathogenic mutation. 

Patients with RCC only. Overall, 164 patients included in this study had 
developed RCC only. Those patients had been recruited through French urology 
and oncogenetic clinics, within the French National Cancer Institute’s ‘Inherited 
Predisposition to Kidney Cancer’ network. They were considered ‘genetically 
enriched’ based on the following criteria: familial aggregation, young age of onset 
and rare histological subtypes. Out of the total of 164 patients, 79 had sporadic 
ccRCC, 54 had papillary renal cell carcinoma (PRCC; with 6 type I PRCC, 19 type 
II PRCC, 1 mixed type PRCC and 28 unknown histological subtypes of PRCC), 1 
had a mixed RCC phenotypes (that is, papillary and ccRCC), 5 had juvenile RCC 
and 25 had RCC of unknown histological subtype. None of the 79 patients with 
ccRCC carried a VHL pathogenic germline mutation; none of the 19 patients with 
PRCCII carried a FH germline mutation; and none of the 6 patients with PRCCI 
carried a MET germline mutation. 

Control subjects. Controls were supplemented by a population-based sample of 
1,659 French subjects that had participated in the Supplementation in Vitamins 
and Mineral Antioxidants (SU.VILMAX) study. We confirmed that these sub- 
jects had not developed any cancer at the time of the present study. 

The study was approved by an institutional review board (CCPPRB no. 01-09- 
05, Paris Necker) for the MELARISK collection and by the Ethical Committee of 
Le Kremlin-Bicétre University Hospital, for the kidney cancer collection. It was 
conducted according to the Declaration of Helsinki Principles. All participating 
subjects signed informed consent and provided blood samples. After informed 
consent had been obtained from the subjects, DNA was extracted from peripheral 
blood lymphocytes with the QlAamp DNA Blood mini kit (QIAGEN), according 
to manufacturer guidelines. 

MITF gene sequencing. Sanger sequencing of MITF coding sequence, the intron- 
exon boundaries and the 8 alternative promoters was performed in DNA extracted 
from the blood of 62 patients affected with melanoma + RCC (primers available 
upon request) on a 3730 DNA Analyser (Applied Biosystems; ABI). Nucleotide 
number refers to the wild-type cDNA sequence of MITF (NM_000248.3) as 
reported in GenBank. 

Genotyping of Mi-E318K. Genotyping was carried out with Taqman according 
to the manufacturer recommendations. Primers and probes were supplied in the 
Assay-by-Design by Applied Biosystems (ABI). PCR reactions were performed 
with 10 ng genomic DNA and 0.2 pmol!’ TaqMan MGB probes. Probe 5’-VIC- 
ATCAAGCAAGAACCCG-3’ was designed to match the wild-type allele, and 
probe 5'-6-FAM-CAAGCAAAAACCCG-3’ was designed to match the mutant 
allele (underlined bold indicates the nucleotide at the mutation site), PCR thermo- 
cycling was performed on ABI thermocyclers, as follows: 95°C for 15 min; 30 
cycles of 95°C for 15s and 60°C for 1 min. Assays were carried out in 96-well 
plates that included a negative control (with no DNA) and positive controls (DNA 
from subjects with Mi-E318K). Plates were read on a 7900HT Fast Real-Time PCR 
System (ABI) with Sequence Detection Software (ABI). Carriers of Mi-E318K 
were confirmed by Sanger sequencing with the above protocol and primers for 
MITF exon 9. 

PCA of genome-wide SNP data. To verify that the association of Mi-E318K with 
melanoma and/or RCC was not influenced by population stratification, we carried 
out PCA of SNP data across the genome in cases and controls that had been 
genome-wide genotyped and satisfied stringent quality control criteria. A total 
of 1,628 controls and 569 cases affected with melanoma, RCC or both cancers were 
genotyped by the Centre National de Génotypage (CNG) using Illumina 
HumanHap300 Beadchip version 2 duo array for controls and Illumina 
Humancnv370k and Human660W-Quad arrays for cases. Samples were excluded 


for any of the following reasons: (1) a call rate of less than 97% of the total number 
of SNPs on the chip (222 samples); (2) sex as ascertained by genotyping not 
matching reported sex (10 samples); (3) heterozygosity on autosomes departing 
from the estimated expected value (7 samples); (4) relatedness with another sam- 
ple (4 samples). This resulted in exclusion of 243 samples. PCA was then applied to 
1,954 subjects that passed quality control using EIGENSTRAT software’. To 
identify individuals of non-European ethnicity, SNPs were thinned to reduce 
linkage disequilibrium and combined with the HapMap data of wide-ranging 
ethnicity. The first two principal components (PCs) clearly separated the 
HapMap data into distinct clusters according to ethnicity and identified 29 study 
samples of non-European ancestry who were excluded. The remaining 1,925 
European ethnicity samples (1,389 controls, 536 cases) were analysed similarly 
without the Hapmap data. Plotting the first two principal components showed 
appropriate clustering of cases and controls (Supplementary Fig. 1). 

MITF immunohistochemistry. Briefly, 4-11m tissue sections were cut from par- 
affin-embedded blocks, deparaffinized and rehydrated. RCC (patient ID numbers 
10276, 24976, 11473, 21309, 21939, 25220, 26534; all in Supplementary Table 2) 
immunohistochemistry was performed with a three-step, avidin—biotin-peroxidase 
method. Staining was performed in an automated Dako Autostainer. Heat-mediated 
antigen retrieval was performed in 0.1 moll~’ pH 6.0 citrate, and samples were 
heated in a water bath for 30 min in a Dakolink processor (Dako). Monoclonal 
mouse anti-human MITF (Clone D5, Dako, 1/100) was incubated with the samples 
for 30 min at room temperature (22 °C). SSM immunohistochemistry (patient ID 
numbers 19525, 24976, 10254, 15168, 15012, 21000, 22112, 27708 all in 
Supplementary Table 2 except number 27708 in Supplementary Fig. 1) was per- 
formed with a three-step avidin—biotin-alkaline-phosphatase method. Staining was 
performed in a Roche Diagnostics Benchmark XT automated stainer. Heat- 
mediated antigen retrieval was performed in the automat in cell conditioning solu- 
tion 1 (Roche Diagnostics) for 1h (Dako). Monoclonal mouse anti- Human MITF 
(Clone D5, Roche Diagnostics, pre-diluted) was incubated with samples for 16 min 
at room temperature. All slides were then counterstained with haematoxylin. 
Appropriate negative and positive controls were prepared in parallel. 

Functional assays. Plasmids. The pcDNA3-Mi construct, which carried the 
M-MITF isoform and the 3M vector have been described previously**. The 
A-MITF isoform was a gift from H. Arnheiter (NIH). The MITF mutations, 
p-K182R and/or Mi-E318K, were generated with the QuickChange method 
(Stratagene) with the following sense primers and their reverse complements: 
Mi-K182R 5'-CTTCCCAACATAAGAAGGGAGCTCACAGC-3';  MI-E318K 
5'-GGATCATCAAGCAAAAACCAGTTCTTGAG-3’. Mutations were con- 
firmed by DNA sequencing. His-SUMO1 and HA-SUMO2 were a gift from A. 
Dejean and have been described elsewhere”. 

Cell cultures, transfections, and immunoblots. Human 501mel and A375 mel- 
anoma cells, human RCC4 cells and HEK293 cells were grown in DMEM supple- 
mented with 7% FBS. Melan-a cells were cultured in RPMI 1640, 7% FCS, 200 nM 
TPA and 200 pM cholera toxin. All the cells were maintained at 37 °C in a humidi- 
fied atmosphere containing 5% CO . HEK293 cells were cultured in 6-well dishes 
(10* cells per well) and transfected with the above-mentioned plasmids (2 Lg of 
total DNA per well) and FuGENE 6 (Roche Applied Science). Forty-eight hours 
later, the cells were washed with PBS and lysed at 95°C in 1X loading buffer 
(41.6 mM Tris, pH 6.8, 1.5% SDS, 6.7% glycerol). Proteins were resolved by elec- 
trophoresis in 10% SDS-polyacrylamide gels and transferred to PVDF mem- 
branes. Proteins were detected with ECL (Amersham) and antibodies to MITF 
(Abcam), HA-tag (Abcam), SUMO] (Santa Cruz Biotechnology) or ERK2 (Santa 
Cruz Biotechnology). 

Reporter assays. Human 501mel melanoma cells and HEK293 cells were seeded 
in 24-well dishes (25 X 10° cells per well). Subsequently, cells were transiently 
transfected with 0.3 1g reporter plasmid (3M, pHIFIA, pMET), 0.05 pg of 
MITF-encoding plasmids or empty, control pcDNA3 (EV), 2 ul of lipofectamine 
reagent (Invitrogen) and 0.05 jig of pCMVfGal for controlling the variability in 
transfection efficiency. Cells were lysed 48h later and assayed for luciferase and 
B-galactosidase activities. Transfections were repeated at least three times. 

Immunofluorescence. HEK293 cells were seeded on glass coverslips (100 X 10° 
cells) in 6-well dishes, and transfected with 3 1g of the different MITF mutants and 
10 yl lipofectamine. Forty-eight hours later, cells were fixed for 10 min with 4% 
paraformaldehyde, permeabilized for 2 min with 0.1% Triton X-100/1% bovine 
serum albumin (BSA), and treated for 2 min with NH,Cl 50 mM. Then, samples 
were washed three times in PBS and stained for 1 h with monoclonal anti-MITF 
antibody (Abcam, clone D5) in 1% BSA/PBS. Next, samples were washed three 
times in PBS for 5 min each and then stained secondarily for 1h with Alexa-594- 
conjugated goat-anti-mouse antibody (Molecular probes) in 1% BSA. Cells were 
counterstained with 4,6-diamidino-2-phenylindole (DAPI), mounted with 
Fluoromount-G (Southern Biotech), and examined with a Zeiss Axiophot micro- 
scope equipped with epifluorescent illumination. 
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Colony forming assay. Immortalized Melan-a mouse melanocytes, A375 
human melanoma cells and RCC4 human cells (8 X 10* cells per well) were 
transfected with wild type or Mi-E318K (31g total DNA per well and 10% 
pBABE-puro) with the FuGENE 6 Transfection reagent (Roche Applied 
Science). Puromycin (1 Lg ml ') was added to media at 48h post-transfection. 
Fourteen days later, the cells were fixed and stained with 0.4% crystal violet, and 
the plates were photographed. 

Establishing stable 501mel cells that expressed tagged MITF. 501mel cells were 
transfected with FuGENE 6 reagent, a vector encoding puromycin resistance, and 
a pCMV vector that was either an empty, control vector or a vector encoding 
3X HA-tagged MITF (wild type or Mi-E318K). Transfected cells were selected 
with puromycin; the expression of MITF was verified by western blot analysis 
with anti-MITF (Abcam, C5) or anti-HA (Roche, 12CA5) antibodies. 

Migration and invasion assays. Migration (on uncoated filters) and invasion (on 
coated filters with matrigel) were investigated in a Boyden chamber system that 
comprised 24-well plates and 8-um pore filter inserts (BD Bioscience). Stable 
501mel melanoma cells (2 X 10° cells), A375 melanoma cells (1.5 X 10° cells) 
and RCC4 renal cancer cells (5 X 10* cells) were infected with control vector or 
adenovirus that encoded wild type or Mi-E318K at a multiplicity of infection 
(MOI) of 20 for 24h, were then resuspended in serum-free DMEM and seeded 
on the upper chamber inserts. DMEM with 7% FCS was placed into the lower 
chamber. Twenty-four hours (501mel and RCC4) or 6h (A375 cells) later, cells 
adherent to the underside of the filters were fixed with 4% PFA and stained with 
0.4% crystal violet. Adherent cells were counted in five random fields at 200 
magnification. Results represented the average of duplicate samples from three 
independent experiments. 

Gene expression arrays. Three replicates were performed for both RCC4 and 
A375 cells. mRNA was isolated with Trizol (Invitrogen) from A375 melanoma 
cells and from RCC4 cells infected with either control, wild type or Mi-E318K, 
according to standard procedures. Briefly, probes were synthesized from 500 ng of 
total RNA in two steps, according to the manufacturer’s instructions. The two 
samples to be compared were labelled separately with different fluorescent dyes, 
cyanine-3 (Cy3) and cyanine-5 (Cy5). For each sample, 1 ug of purified CRNA 
labelled in cy5 was mixed with the same amount of cRNA labelled in cy3. Label 
incorporation was checked on a NanoDrop spectrophotometer. Hybridizations 
were performed with a dye-swap strategy on whole-human-genome dual colour 
8 X 60K oligonucleotide microarrays (design 028004; Agilent Technologies). 
Feature extraction software provided by Agilent (version 10.7.3.1) was used to 
quantify the intensity of fluorescent images and to apply a linear/lowess normal- 
ization to correct for artefacts caused by nonlinear rates of dye incorporation and 
inconsistent relative fluorescence intensities between some green and red dyes. All 
data were imported into Resolver software (Rosetta Biosoftware) for database 
management, quality control, computational re- 

combination of dye-swaps and statistical analysis. Mi-E318K specific signature 
was generated using the following parameters: intensity >50; fold change >1.5; P 
value <0.05. Functional analysis was carried out with the Ingenuity Pathway 
Analysis (Ingenuity System, http://www.ingenuity.com). 
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ChIP-seq. ChIP-seq was performed as previously described”. Briefly, chro- 
matin was prepared from native 501mel cells or cells that stably expressed 
HA-tagged wild type or Mi-E318K. The isolated chromatin was HA-immu- 
noprecipitated and sequenced with an Illumina GAIIx sequencer. The raw 
data were analysed with the Illumina Eland pipeline V1.6 programme. 
Peak detection was performed with MACS software (http://liulab.dfci-har- 
vard.edu/MACS/), and the peaks were annotated with GPAT software (http:// 
bips.u-strasbg.fr/GPAT/Gpat_home.html). Peak annotations were performed 
with a window that included +20 kb from the coordinates at the beginning and 
end of RefSeq transcripts. The total number of reads for the wild-type data set was 
1.7 fold higher than for the Mi-E318K data set. To facilitate quantitative compar- 
isons, an appropriate number of reads was randomly removed from the wild-type 
data set to match the number present in the Mi-E318K data set. Subsequently, 
a quantitative comparison of the ChIP-seq data sets was performed with 
seqMINER”. Clustering was performed by counting the number of tags in a 36- 
bp sliding window for each ChIP-seq data set based on the coordinates of wild-type 
or Mi-E318K binding sites or on a reference sequence (RefSeq TSS). Only peaks 
with =10 reads were used for comparison. A matrix of binding sites and densities 
was generated and subjected to K-means clustering with the seqMINER pro- 
gramme. 

Statistical analysis. Fisher's exact test was used to compare Mi-E318K allele 
frequencies between cases and controls and among different groups of cases. 
The odds ratios and confidence intervals associated with Mi-E318K carrier status 
were estimated by exact logistic regression. Logistic regression was also used to test 
for the effect of Mi-E318K on melanoma and/or RCC with and without adjusting 
for the first ten principal components estimated from genome-wide SNP data in 
patients and controls who were genome-wide genotyped and passed quality con- 
trol. All computations were performed with Stata software, version 11 (StataCorp 
LP). For functional assays, results were considered significant when the Student’s 
t-test, *P value was <0.05, **P < 0.01, ***P = 0.001. 
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Changes in plant community composition lag behind 
climate warming in lowland forests 


Romain Bertrand’, Jonathan Lenoir’, Christian Piedallu'’, Gabriela Riofrio-Dillon!’, Patrice de Ruffray*, Claude Vidal®, 


Jean-Claude Pierrat'* & Jean-Claude Gégout!* 


Climate change is driving latitudinal and altitudinal shifts in species 
distribution worldwide'”’, leading to novel species assemblages**. 
Lags between these biotic responses and contemporary climate 
changes have been reported for plants and animals’. Theoretically, 
the magnitude of these lags should be greatest in lowland areas, 
where the velocity of climate change is expected to be much greater 
than that in highland areas®. We compared temperature trends to 
temperatures reconstructed from plant assemblages (observed in 
76,634 surveys) over a 44-year period in France (1965-2008). Here 
we report that forest plant communities had responded to 0.54 °C of 
the effective increase of 1.07°C in highland areas (500-2,600 m 
above sea level), while they had responded to only 0.02 °C of the 
1.11 °C warming trend in lowland areas. There was a larger temper- 
ature lag (by 3.1 times) between the climate and plant community 
composition in lowland forests than in highland forests. The 
explanation of such disparity lies in the following properties of 
lowland, as compared to highland, forests: the higher proportion 
of species with greater ability for local persistence as the climate 
warms’, the reduced opportunity for short-distance escapes*”, 
and the greater habitat fragmentation. Although mountains are 
currently considered to be among the ecosystems most threatened 
by climate change (owing to mountaintop extinction), the current 
inertia of plant communities in lowland forests should also be noted, 
as it could lead to lowland biotic attrition”®. 

Despite the general pattern of poleward and upward range shifts'”, 
the timing, magnitude and direction of these biotic responses vary 
greatly among species and geographic conditions’**. This variation 
can prevent biotic communities from migrating as intact units in res- 
ponse to climate change’. Since the last interglacial, biotic communities 
have been reshuffled, leading to the formation of novel species assem- 
blages linked to new climatic conditions**. Recent changes supporting 
this pattern have occurred within plant and animal communities in 
response to contemporary climate warming’*'*. Even though these 
biotic signals are coherent with climate change’, many studies suggest 
a lag between the magnitude of climate warming and the reshuffling 
observed in biotic communities*'*’”-'°. However, comparisons between 
the extent of the observed biotic signals and that expected based on the 
intensity of climate warming are still scarce, especially for lowland areas, 
which offer no short-distance escapes for species facing climate 
change*”. If biotic communities respond synchronously without lagging 
behind climate warming, current species compositions would reflect 
contemporary climatic conditions within both lowland and highland 
areas. However, the greater velocity of climate change in lowland areas 
compared to highland areas® suggests that reshuffling in biotic com- 
munities is more likely to lag behind climate warming in lowland areas 
compared to highland areas. However, this theory remains unverified. 

Climate change in France has been characterized by increases in 
mean annual temperatures of far greater magnitude than the worldwide 


average, reaching up to about +1.05 °C on average during 1987-2008 
(Supplementary Fig. 2), compared to about +0.35°C for the worldwide 
average”. Assuming that species assemblages can be used to infer 
climatic conditions at a given location and time (an approach com- 
monly used by paleoclimatologists”', who refer to the relationship as a 
transfer function), we report a study monitoring plant communities 
across French forests over 44 years, 1965-2008 (totalling 76,634 
surveys; Supplementary Figs 1, 3 and 4). Here we compare temperature 
trends reconstructed from observed reshuffling of plant assemblages 
(‘floristically reconstructed’ temperatures based on 760 species; 
Supplementary Table 1) with temperature trends reconstructed from 
instrumental records and climate models (‘climatically reconstructed’ 
temperatures based on 237 stations; Supplementary Fig. 5) in lowland 
(<500 m a.s.l.) and highland forests (500-2,600 m a.s.l.) (Supplemen- 
tary Methods). Changes in floristically reconstructed temperatures 
involve reshuffling of plant community composition towards more 
heat- or cold-demanding species assemblages. Following the current 
climate warming, small differences between floristically and climatically 
reconstructed temperatures may indicate important reshuffling of 
plant community composition, close to the magnitude of climate 
change (that is, high recovery of the climate-flora equilibrium; 
Fig. 1). In contrast, large differences between floristically and climatically 
reconstructed temperatures may indicate that reshuffling in plant com- 
munities lags behind climate warming (that is, low recovery of the 
climate-flora equilibrium; Fig. 1). 

Before the contemporary climate warming event (1965-86; Sup- 
plementary Fig. 2), we found no significant differences between 
floristically and climatically reconstructed temperatures in lowland 
and highland forests (Table 1), suggesting close equilibrium between 
species composition in biotic assemblages and temperature conditions. 
In contrast, during the current climate warming period (1987-2008), we 
found significant differences between floristically and climatically recon- 
structed temperatures in lowland and highland forests (Table 1), indi- 
cating a lag in the response of herbaceous forest species to climate 
change. In lowland forests, we found no significant increase in florist- 
ically reconstructed temperatures between 1965-86 and 1987-2008 
(+0.02 °C on average), whereas climatically reconstructed temperatures 
increased by +1.11°C on average (Table 1a). In contrast, the floristically 
reconstructed temperatures of highland forests increased by +0.54 °C, 
with climate warming reaching + 1.07 °C over the same period (Table 
1b). This discrepancy indicates an important lag in the response of 
herbaceous forest species to climate change, being 3.1 times larger in 
lowland forests (temperature lag = 1.29 °C on average) than in highland 
forests (temperature lag = 0.42 °C on average; Table 1). In 2008, the 
most recently investigated year, this lag decreased to an average of 
0.28 °C in highland forests but was still 1.26 °C in lowland forests (Fig. 2). 

Our results suggest that the recovery of forest plant communities 
facing climate warming is weak in lowland forests but strong and fast 
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Figure 1 | Theoretical response of plant communities to climate warming. 
a, Floristically (green scale; FrT) and climatically (red; CrT) reconstructed 
temperature trends over time. The green scale (0-100%) describes different 
hypothetical floristically reconstructed temperature trends corresponding to 
increasing intensity of the plant community responses and leading to 
increasing recovery of the climate-flora equilibrium (measured as the actual 
change in FrT over time relative to the effective change in CrT over the same 
time period; see Supplementary Methods for complete formula). 

b, Temperature niche separation among 10 virtual species. The range of 
temperature requirements for each species is represented by a vertical line. 

c, Three illustrative cases of increasing recovery of the climate—flora 
equilibrium in plant community composition (based on the 10 virtual species in 


in highland forests, reaching +0.26 °C per decade (Fig. 2). This rate of 
increase in floristically reconstructed temperatures confirms the 
results of a previous study, which reported a similar trend of 
+0.25 °C per decade for herbaceous forest plants in the French moun- 
tains, despite the use of a completely different approach based on the 
comparison of species’ optimum elevation between 1905-85 and 
1986-2005"’. In a Mediterranean forest, the flora turnover observed 
over the last decade was half that expected from temperature 
changes”, whereas we found that reshuffling of forest plant com- 
munities recovered 0% and 60.7% (up to 68.2% in 2008) of the effective 
temperature increase between 1965-86 and 1987-2008 in lowland and 
highland areas, respectively. In highland forests, plant communities 
even completely recovered the effective change in temperature (that is, 
no significant differences between floristically and climatically recon- 
structed temperatures) for the years 1991, 1999, 2002, 2004 and 2005 
(Fig. 2). Thus, lowland areas seem to be one of the least reactive 
terrestrial ecosystems to climate warming with respect to forest plant 
communities. 

Several possible factors may explain the lower recovery of forest plant 
communities in response to climate warming in lowland versus high- 
land areas. First, the larger proportion of cosmopolitan and thermo- 
philous species (Supplementary Methods) in lowland communities 
(75.2% and 15.3% on average) compared to highland communities 
(67.2% and 8.8% on average) may result in a greater tolerance of low- 
land communities to climate warming throughout local persistence’. 
We did not find any significant changes in the composition of lowland 
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b). Bottom, absence of climate-flora equilibrium recovery, corresponding to 
plant communities composed mainly of cold-demanding species, reflecting 
temperature conditions before climate warming (To), and leading to an 
important temperature lag between FrT and CrT. Middle, partial recovery of 
the climate-flora equilibrium, corresponding to reshuffled plant communities, 
leading to a mixed assemblage of cold- and heat-demanding species and to 
reduced temperature lag between FrT and CrT. Top, complete recovery of the 
climate-flora equilibrium, corresponding to important reshuffling of the plant 
community, leading to an assemblage of heat-demanding species reflecting the 
effective temperature increase (T, — To) and to the absence of temperature lag 
between FrT and CrT. The small circles represent virtual plant species; the three 
large green disks each depict a community of plants. 


plant communities between 1965-86 and 1987-2008 (Fig. 3), whereas 
the proportion of mountainous and alpine species in highland plant 
communities decreased significantly (— 4.6% on average), to the benefit 
of cosmopolitan species (+5.1% on average). In addition, we suggest 
that compensatory changes in demographic rates may buffer popu- 
lation dynamics against the negative effects of climate warming, 
causing a temporary delay in species turnover, especially in lowland 
plant communities (for example, a higher growth of individuals can 
compensate for the lower survival and recruitment rates, allowing 
persistence of threatened populations)’*. Second, highland forests offer 
shorter-distance escapes for species facing climate warming compared 
to lowland forests*”, especially for forest plants that have strongly 
limited dispersal abilities’. Thus, species distributions are expected 
to migrate +1.1km (median value; 5th to 95th percentile range 
(PRs,.95) = [0.4; 15]) upward in French highland forests, but 
+35.6km (median value; PRs.95 = [2.1; 137.6]) northward in French 
lowland forests to track their climatic niches according to the con- 
temporary climate warming event (1987-2008; Supplementary 
Fig. 9). Third, the migration rate of plant species may be delayed by 
the level of habitat fragmentation*”*. Forest patches exceeding 5 km? 
cover a larger proportion of highland areas (67.3%) than lowland areas 
(29.1%; Supplementary Fig. 10a). Moreover, forest patches are closer in 
highland areas (median proximity index (PI) = 3,813.7; PRs.95 = [15.3; 
122,034.3]; see Supplementary Methods for details) than in lowland 
areas (median PI= 31.6; PRs.95=[1.3; 14,644.2]; Supplementary 
Fig. 10b). All of these factors are not mutually exclusive and may, 


Table 1 | Comparison of floristically and climatically reconstructed temperatures. 


a Lowland b Highland 
Reconstruction Period Reconstruction Period 

1965-1986 (P;) 1987-2008 (P2) P2— Py 1965-1986 (P;) 1987-2008 (P2) Po — Py 
CrT 10.54 (1.2) 11.66 (1.23) 1.11 (0.02)** CrT 7.98 (1.8) 9.05 (1.8) 1.07 (0.03)** 
Frit 10.35 (1.23) 10.37 (1.22) 0.02 (0.02) Frt 8.09 (1.77) 8.63 (1.61) 0.54 (0.03)** 
Fret — Crt —0.19 (0.87) 1.29 (0:99)** Fert —CrT 0.11 (1.12) 0.42 (1.23)* 


Shown are mean temperatures with standard deviations (in brackets) computed from the 1,000 floristically (FrT) and climatically (CrT) reconstructed temperature trends. P,, Value for 1965-86; Po, value for 
1987-2008. a, Comparison for French lowland forests (<500 m a.s.l.). b, Comparison for French highland forests (500-2,600 m a.s.l.). Significance of temperature differences between periods (Student's t test) 
and between modelling approaches (Student's paired t test) are indicated: *P<0.01, ** P<0.001 for more than 95% of the 1,000 reconstructed temperatures. 


2 | NATURE | VOL 000 | 00 MONTH 2011 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 43, By. 

12- 40- 
° 
ry 
a 

@ 114 94 
oO 
a 
€ 
© 

10- 85 

9 74 

1970 1980 1990 2000 2010 1970 1980 1990 2000 2010 
Year Year 


Figure 2 | Comparison of floristically (green) and climatically (red) 
reconstructed temperature trends between 1965 and 2008. a, Trends in 
lowland forest plant communities (<500 m a.s.l.). b, Trends in highland forest 
plant communities (500-2,600 m a.s.l.). The thickness of lines shows the range 


independently or combined, explain our results showing that re- 
shuffling of plant communities in lowland forests strongly lag behind 
climate warming. 

In this study, we linked reshuffling observed in plant assemblages in 
French forests to temperature changes between 1965 and 2008. 
However, water stress may also drive species range shifts''. In 
California, the core of the altitudinal range of 46 out of 64 forest plant 
species shifted downward in response to regional changes in climatic 
water balance between 1935 and 2005"’. In contrast, no significant 
changes in the level of annual precipitation was observed throughout 
the French territory between 1965-86 and 1987-2008 (mean value of 
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Figure 3 | Compositional changes in the plant communities of lowland and 
highland forests according to four different biogeographic groups. Mean 
shifts in the proportions of plant communities are shown (data points) with 
standard deviations (error bars) estimated from 1,000 floristic samples used to 
reconstruct temperatures. The significance of changes from the null hypothesis 
of zero shift is displayed (*P < 0.05 for more than 95% of the floristic samples; 
Wilcoxon paired signed-rank test). The number of species analysed in lowland 
(ny) and highland (4) plant communities are displayed below the figure. Mo, 
mountainous to alpine species; SMo, sub-mountainous to lowland species; Cos, 
cosmopolitan species; ThM, thermophilous to Mediterranean species. See 
Supplementary Methods and Supplementary Table 1 for more details about the 
different biogeographic groups. 


of reconstructed temperature trends (n = 1,000 trends). Dashed lines indicate 
the start of the contemporary climate warming period (1987-2008). Breaks in 
trends are due to no sample convergence for the years 1965 (in highland areas), 
1972 and 1974 (in both lowland and highland areas). 


precipitation changes [s.d.] = +1.04mm [48.56] in lowland areas, 
P= 0.35, Student’s t test for paired samples; — 0.93 mm [60.09] in high- 
land areas, P = 0.44, Student’s t test for paired samples; Supplemen- 
tary Fig. 2). Thus, the absence of changes in precipitation patterns over 
the 44-year study period is unlikely to explain the differences of 
reshuffling between lowland and highland forest plant communities. 
Similarly, acclimation or adaptation processes may also explain the 
differences we observed between floristically and climatically recon- 
structed temperatures. The large difference in temperature recovery 
between highland and lowland forest plant communities may indicate 
distinct interchanges between migration and adaptation, with migra- 
tion being more prevalent in highland forests and adaptation more 
prevalent in lowland forests. Nevertheless, such processes are probably 
insufficient to mitigate completely the effects of the high temperature 
increase in just 22 years, especially at the scale of a large pool of 
perennial species in a fragmented landscape*”’. 

Although a potential lag between the magnitude of climate warming 
and the extent of changes within biotic communities has already been 
suggested*'*’”-”, we provide the first (to our knowledge) assessment of 
such a lag and show that it is more important in lowland areas. This 
finding has important implications in terms of conservation issues. To 
date, a strong focus has been placed on mountaintop extinctions for 
species facing climate warming’. However, mountaintop extinctions 
are just one example, although important, of the likely consequences 
of contemporary species range shifts in highland areas. Highland 
areas offer short-distance escapes for species facing climate change*” 
and provide ideal refuges for plant species to track their climatic 
requirements’, which could lead to the formation of novel species 
assemblages. In lowland areas, the current inertia of forest plant 
communities suggests delayed extinction and immigration in the face 
of climate warming” due to species tolerance and low migration rate. 
Considering the velocity of predicted future climate change®, the 
low dispersal abilities of forest plants’? and the low opportunity for 
short-distance escape to track climate changes in lowland areas*’, the 
current inertia of forest plant communities will not be indefinite. 
Consequently, abrupt reshuffling of these plant communities may 
occur in the future, possibly leading to lowland biotic attrition’? with 
important changes expected in ecosystem functioning”’. Therefore, we 
recommend that research efforts be focused on understanding the 
implications of climate change in lowland areas. 


METHODS SUMMARY 


To achieve our aim, we compiled a complete species list including the year and 
location of 79,621 floristic surveys (44,509 and 35,112 in lowland and highland 
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areas, respectively) that were recorded across the French forest territory between 
1965 and 2008. We computed a 1-km” grid of the mean annual temperature across 
the French territory for each year between 1965 and 2008 using a modelling 
approach based on 237 non-interrupted meteorological stations (R* = 0.93 for 
13,620 independent temperature observations; Supplementary Figs 5 and 6)**. 
These climatically reconstructed temperatures were extracted at the location and 
year of each of the 79,621 floristic surveys. We then subsampled a training data set 
of 2,987 floristic surveys before the recent climate warming event (1975-85) to 
calibrate a transfer function using a combination of weighted averaging partial 
least squares” and Breiman’s random forest” to infer temperatures from the plant 
assemblages (R* = 0.83 in a validation data set; Supplementary Fig. 7). These 
floristically reconstructed temperatures were then predicted across the French 
territory between 1965 and 2008 using the 76,634 floristic surveys. This predictive 
data set was bootstrapped separately in lowland and highland areas for each year to 
control for artificial temperature variations associated with inter-annual variations 
in floristic survey locations (Supplementary Fig. 8). We reconstructed 1,000 pairs 
of floristically and climatically reconstructed mean annual temperature trends 
separately for lowland and highland areas. We tested temperature differences in 
each pair of trends between 1965-86 and 1987-2008, as well as annually. To 
discuss our results, we assessed changes in plant community composition in four 
biogeographic groups of species. We then computed the mean distance separating 
equal isotherms between periods, in addition to performing independent spatial 
analysis of the degree of habitat fragmentation in lowland and highland forest areas 
(see Supplementary Methods for more details). 
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Caspase-1 activation by inflammasome scaffolds comprised of 
intracellular nucleotide-binding oligomerization domain (NOD)- 
like receptors (NLRs) and the adaptor ASC is believed to be essential 
for production of the pro-inflammatory cytokines interleukin (IL)- 
1f and IL-18 during the innate immune response’. Here we show, 
with C57BL/6 Casp11 gene-targeted mice, that caspase-11 (also 
known as caspase-4)** is critical for caspase-1 activation and IL- 
1B production in macrophages infected with Escherichia coli, 
Citrobacter rodentium or Vibrio cholerae. Strain 129 mice, like 
Casp11~’~ mice, exhibited defects in IL-1B production and 
harboured a mutation in the Casp11 locus that attenuated 
caspase-11 expression. This finding is important because published 
targeting of the Casp1 gene was done using strain 129 embryonic 
stem cells’. Casp1 and Casp11 are too close in the genome to be 
segregated by recombination; consequently, the published Casp1~ 
mice lack both caspase-11 and caspase-1. Interestingly, Casp11” 
macrophages secreted IL-1B normally in response to ATP and 
monosodium urate, indicating that caspase-11 is engaged by a 
non-canonical inflammasome. CaspI™Casp11'?™°™ macro- 
phages expressing caspase-11 from a C57BL/6 bacterial artificial 
chromosome transgene failed to secrete IL-1f regardless of 
stimulus, confirming an essential role for caspase-1 in IL-1B pro- 
duction. Caspase-11 rather than caspase-1, however, was required 
for non-canonical inflammasome-triggered macrophage cell death, 
indicating that caspase-11 orchestrates both caspase-1-dependent 
and -independent outputs. Caspase-1 activation by non-canonical 
stimuli required NLRP3 and ASC, but caspase-11 processing 
and cell death did not, implying that there is a distinct activator 
of caspase-11. Lastly, loss of caspase-11 rather than caspase-1 
protected mice from a lethal dose of lipopolysaccharide. These data 
highlight a unique pro-inflammatory role for caspase-11 in the 
innate immune response to clinically significant bacterial 
infections. 

Many bacterial toxins promote inflammasome activation’*°, and 
this is also true for cholera toxin B (CTB), a component of the AB5 
holotoxin complex (Fig. 1a and Supplementary Fig. 1). CTB lacks the 
enzymatic activity necessary for entero-pathogenesis, but binds to 
GM1 ganglioside on the cell surface to facilitate the entry of enzymatic 
component cholera toxin A (CTA)’®. Similar to ATP, which is a 
known activator of caspase-1 (ref. 17), CTB induced NLRP3- and 
ASC-dependent processing and secretion of caspase-1 and IL-1 from 
lipopolysaccharide (LPS)-primed bone-marrow-derived C57BL/6 
macrophages (BMDMs). Several other NLR family members 
(NLRC4, NLRP6, NLRP12, NOD1 and NOD2), inflammasome com- 
ponent AIM2 (refs 18-20), the ATP-releasing pannexin-1 channel, 
and the ATP-gated P2RX7 receptor’! were dispensable for CTB- 
induced IL-1 secretion (Supplementary Fig. 1a). These data indicate 
that CTB does not activate the NLRP3 inflammasome indirectly by 
causing ATP release and P2RX7 receptor stimulation. To investigate 
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other host factors involved in the inflammasome response to CTB, we 
determined whether macrophages from other mouse strains secrete 
IL-1f when exposed to CTB. Polymorphisms in the Nirp1b gene, for 
example, alter sensitivity to anthrax lethal toxin’. LPS-primed 12986 
BMDMs cultured with ATP processed caspase-1 and secreted IL-1, 
but they were unresponsive to CTB (Fig. 1b, c). Western blotting for 
pro-inflammatory adaptor proteins and caspases indicated that 129S6 
macrophages were deficient in pro-caspase-11 isoforms’ p43 and p38 
(Fig. 1d). Quantitative polymerase chain reaction with reverse tran- 
scription (RT-PCR) confirmed attenuated LPS-induced Casp11 
messenger RNA expression in 12986 BMDMs (Fig. le). Caspase-11 
is similar to caspase-1 (46%), orthologous to human caspase-4 and -5, 
and has been shown to interact with caspase-1 upon overexpression®®. 
Although caspase-11 was shown to be essential for LPS-induced IL-1 
secretion in vivo’, the role of caspase-11 in macrophage inflammasome 
signalling and its mechanism of activation remain unclear. 

Primers that amplified the full-length Casp11 mRNA expressed in 
C57BL/6 BMDMs after culture with either LPS or E. coli recovered 
only a truncated Casp11 complementary DNA (A110) from similarly 
treated 129S6 cells (Fig. 1f). Sequencing revealed that the 12986 Casp11 
transcript lacked all sequence encoded by exon 7. Splicing of exon 6 to 
exon 8 creates a frame-shift after proline 304 and a stop codon 
occurred after 5 aberrant amino acids (Fig. 1g). Absence of exon 7 
expression was confirmed in 129S6 BMDMs by quantitative RT-PCR 
for Casp11 exon 7 (Supplementary Fig. 2a). Macrophages from three 
additional 129 substrains (129X1, 129S1 and 129P3) also lacked 
detectable caspase-11 protein, expressed Casp11 A110 mRNA, and 
produced negligible IL-1B in response to CTB (Supplementary Fig. 
2b-d). Sequencing of 129S1 genomic DNA identified a 5-bp deletion 
encompassing the exon 7 splice acceptor junction as the origin of the 
A110 isoform (Fig. 1h). The caspase-11 antibody used in our experi- 
ments detected the truncated caspase-11 protein encoded by Casp11 
A110 when it was overexpressed in 293T cells (Supplemen- 
tary Fig. 2e) but failed to detect endogenous caspase-11 A110 product 
in 129 BMDMs (Fig. 1d and Supplementary Fig. 2b), possibly due to 
nonsense-mediated mRNA decay (Fig. le and Supplementary Fig. 2a). 
Caspase-11 A110 protein product lacks most of the small catalytic 
subunit so any protein that is expressed should be non-functional 
(Supplementary Fig. 2f). 

We explored further the role of caspase-11 in inflammasome activa- 
tion by targeting Casp11 exon 5 for deletion in C57BL/6 embryonic 
stem (ES) cells. This exon encodes the critical catalytic residue cysteine 
254 (ref. 7) and most of the caspase-11 large catalytic subunit (Sup- 
plementary Fig. 3). LPS-primed C57BL/6 Casp11*’* (wild type) 
and Casp11" BMDMs were stimulated with ATP to engage the 
NLRP3-dependent inflammasome, poly(dA:dT) double-stranded 
DNA or Francisella tularensis to activate the AIM2-dependent inflam- 
masome, and flagellin or Pseudomonas aeruginosa to engage the 
NLRC4-dependent inflammasome’**’. These canonical stimuli 


Department of Physiological Chemistry, Genentech Inc., South San Francisco, California 94080, USA. Department of Molecular Biology, Genentech Inc., South San Francisco, California 94080, USA. 
3Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium. “Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium. °Department of Bioinformatics, Genentech Inc., South San 
Francisco, California 94080, USA. ®Department of Immunology, Genentech Inc., South San Francisco, California 94080, USA. 


00 MONTH 2011]! VOL 000 | NATURE | 1 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


© © 
Woop o 
NDHANDND 
SSS kEE 
OXGAGA 
Sup wm | p20 caspase-1 
ewtedee [17 1L-18 
oe ee ee oe Pro-caspase-1 
Coll ext | eam! Pro-iL-1 


a 8,000 b 5,000 
0 C57BL/6 
= 6,000 ENirp3~ = 
€ Nirc4-/- = 
2 4,000} BAim2” 3 
= a 
Cont ATP | CTB | 
d e 15 C57BL/6 
CS57BL/6 1298 Casp11 @ 12986 
sesees a : 
OjWOnW 8 5 
oo =? + P43 | Pro- 2 
oe + P38 Icaspase-11 > 
S 15,000 
ow ew ee ee ee Pro-caspase-1 x 1b 
=~ @ ewenten | ASC z 10,000 7 
se eee) Pro-il-1p E 
Cell ext 5,000 
Cont LPS E.coli 


Figure 1 | Germline mutation of Casp11 in mouse strain 129 abolishes 
inflammasome activation by CTB. a, b, IL-1 secretion by LPS-primed 
BMDMs after culture in medium alone (Cont), 5mM ATP or 20 pg ml 'CTB 
for 16h. WT, wild type. c, LPS-primed BMDMs stimulated for 8 h. IL-18 and 
the caspase-1 p20 subunit were immunoblotted in cell supernatants (Sup), pro- 
caspase-1 and pro-IL-1f in cell extracts (Cell ext). d, BMDM extracts after 6 h. 


induced comparable IL-1f secretion from wild-type and Casp117~ 
BMDMs (Fig. 2a). Like Casp11 mutant 129 BMDMs, however, 
Casp11’~ BMDMs failed to secrete IL-1 in response to CTB 
(Fig. 2a). IL-1 secretion in response to other bacterial toxins, includ- 
ing adenylcyclase (AC) toxin’, listeriolysin O (LLO) toxin’’ or 
Clostridium difficile toxin B"’, was not affected by caspase-11 deficiency. 
Caspase-11 also was dispensable for NLRP3-dependent IL-1 secretion 
in response to monosodium urate (MSU), calcium pyrophosphate 
(CPPD)*”? or the ionophore nigericin'’’ (Fig. 2b). In contrast, 
NLRP3- and ASC-dependent IL-1 secretion from BMDMs infected 
with live E. coli, C. rodentium and V. cholerae”, required caspase-11, 
with or without LPS priming (Fig. 2b and Supplementary Fig. 4a-c). 
Hereafter, we refer to this caspase-11-dependent inflammasome as the 
non-canonical inflammasome, and all stimulations are performed with 
LPS-primed BMDMs. The non-canonical inflammasome that we 
describe seems to be distinct from the human caspase-5/NLRP1 com- 
plex reported previously” because Casp11'7°™"!?°™ BMDMs exposed 
to anthrax lethal toxin secreted IL-1f normally” (data not shown). 
Caspase-1 is thought to cleave pro-IL-1B and pro-IL-18 into bio- 
logically active IL-1 p17 and IL-18 p18, respectively>’"°*””. Secretion 
of processed caspase-1 subunits p20 and p10, IL-18 and IL-18 in 
response to CTB or E. coli required caspase-11, ASC and NLRP3 
(Fig. 2c, d). Wild-type BMDMs cleaved pro-caspase-11 and produced 
the autocatalytically cleaved caspase-11 p26 subunit®’ (Supplementary 
Fig. 4d) in response to CTB or E. coli (Fig. 2e). ATP also stimulated 
formation of the caspase-11 p26 subunit (Fig. 2e) but some time after 
caspase-1 processing peaked (data not shown). Because caspase-11 
was dispensable for ATP-induced processing and secretion of 
caspase-1, IL-1 and IL-18 (Fig. 2a, c, d and Supplementary Fig. 4f), 
we believe that this caspase-11 processing after ATP treatment reflects 
the promiscuity of caspase-1 (ref. 28) in our in vitro assay conditions 
and does not represent a physiological function of caspase-11. 
Consistent with a role for caspase-11 in non-canonical caspase-1 pro- 
cessing, endogenous pro-caspase-11 p43 and its p26 cleavage product 
co-immunoprecipitated with endogenous caspase-1 from BMDMs 
treated with CTB or E. coli (Supplementary Fig. 4e). A much weaker 
interaction occurred between pro-caspase-1 and pro-caspase-11 after 
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e, Casp11 and Il1b mRNA expression in BMDMsafter 6 h. f, Full-length Casp11 
transcripts recovered from BMDMs in e. g, 129 Casp11 A110 isoform splices 
exon 6 (black) to exon 8 (red). h, 129S1 and C57BL/6 Casp11 genes. Exon 7 
sequence is uppercase and the splice acceptor sequence is boxed. Graphs show 
the mean = s.d. of triplicate wells and are representative of three independent 
experiments. 


canonical ATP treatment. Unexpectedly, CTB and E. coli both induced 
secretion of the caspase-11 p26 subunit in the absence of NLRP3 or 
ASC (Fig. 2e). The mechanism of caspase-11 activation remains 
unknown. 

Canonical inflammasome activation by ATP causes NLRP3- and 
ASC-dependent macrophage death and releases pro-inflammatory IL- 
1a and high-mobility group box 1 protein (HMGB1), which both lack 
a leader sequence and a typical caspase cleavage site’ (Fig. 2d, e). 
Caspase-11 was not required for this death because wild-type and 
Casp11’~ BMDMs stimulated with either ATP or C. difficile toxin B 
released comparable amounts of HMGB1 (Fig. 2e), IL-1 and an indi- 
cator of cell membrane damage, lactate dehydrogenase (LDH; Fig. 2d). 
In contrast, caspase-11 deficiency, rather than NLRP3 or ASC loss, 
prevented HMGBI1, IL-1«and LDH release triggered by non-canonical 
activators CTB, E. coli, C. rodentium and V. cholerae (Fig. 2d, e). 
Collectively, our data indicate that caspase-11 is an essential effector 
of non-canonical caspase-1 activation and IL-1B/IL-18 secretion 
together with NLRP3 and ASC, but signals death independently of 
NLRP3 and ASC (Fig. 2f). 

Next we assessed non-canonical inflammasome activation in 
Casp1' ~ mice’®, but this strain and an independent knockout strain’ 
were generated from 129 ES cells harbouring the Casp11 mutation 
described in Fig. 1. Casp1 and Casp11 are adjacent in the mouse genome 
and separated by ~1,500 base pairs. Consequently, backcrossing to 
C57BL/6 or NOD/ShiLt] mice over multiple generations is highly 
unlikely to segregate the two mutant genes. Indeed, CaspI~ 
BMDMs expressed the Casp11 A110 isoform and lacked detectable 
caspase-11 protein after stimulation with diverse Toll-like receptor 
agonists regardless of their genetic background’ (Fig. 3a, b and 
Supplementary Fig. 5a, b). Hereafter, we therefore refer to these 
Casp1/Casp11'°™?°™ mice as Casp1/11 double-knockout mice. 

To obtain mice lacking only caspase-1, we microinjected a Casp11 
bacterial artificial chromosome (BAC) transgene (Tg) into Casp1/11 
double-knockout embryos. Casp1 exons 1 and 2 within the BAC were 
deleted to prevent any caspase-1 expression. Western blotting iden- 
tified a Casp1’’ Casp11'® line that exhibited inducible caspase-11 
expression (Fig. 3c), and expression of Casp11 exon 7 was confirmed 
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Figure 2 | Caspase-11 mediates non-canonical inflammasome activation by 
CTB, E. coli, C. rodentium and V. cholerae. a, b, IL-1 secretion by LPS- 
primed BMDMs stimulated as indicated for 16h. Cont, medium alone. 

c, e, Immunoblots of IL-1B, IL-18, cleaved caspase-1 subunits (c), caspase-11 
p26 and HMGBI (e) released from LPS-primed BMDMs after stimulation for 


by quantitative RT-PCR (Supplementary Fig. 6a). Casp1~Casp11"® 
or Casp1/11 double-knockout BMDMs, unlike their wild-type counter- 
parts, both failed to process and secrete IL-1B and IL-18 in response to 
ATP, C. difficile toxin B, CTB or E. coli (Fig. 3d, e and Supplementary 
Fig. 6b, c). These data confirm the essential role of caspase-1 in IL-1B 
and IL-18 production. Both caspase-1 and caspase-11 were required for 
IL-1 and IL-18 secretion triggered by non-canonical stimuli, and 
although recombinant caspase-11 cleaved caspase-1, it processed IL- 
18 poorly in the absence of caspase-1 (refs 6, 7; data not shown). 
Therefore, pro-IL-1f probably is a direct substrate of caspase-1 rather 
than caspase-11 in the non-canonical inflammasome pathway. 

Next we determined the contribution of caspase-1 to macrophage 
death induced by non-canonical activators CTB and E. coli (Fig. 3f). 
Casp11*' and Casp1Casp11"® showed similar LDH release to wild- 
type BMDMs, whereas Casp1/11 double-knockout and Casp11~ 
BMDMs showed enhanced survival. The Casp11 transgene also 
restored IL-1 (Fig. 3f) and HMGBI release (Fig. 3g) from Casp1/11 
double-knockout BMDMs, with caspase-11 p26 being released as in 
wild-type BMDMs (Fig. 3g). These results indicate that caspase-1 is 
dispensable for caspase-11 activation and macrophage death induced 
by the non-canonical inflammasome (Fig. 2f). In contrast, macrophage 
death plus the release of IL-1a and HMGB1 induced by canonical 
activators ATP and C. difficile toxin B required caspase-1 but not 
caspase-11 (Fig. 3f, g). The molecular mechanisms distinguishing 


8h. d, IL-18, LDH and IL-1o released from LPS-primed BMDMS after 
stimulation for 16h (ATP, for 8h). f, Model for canonical and non-canonical 
inflammasome signalling. Graphs show the mean = s.d. of triplicate wells and 
are representative of three independent experiments. 


caspase-1 activation in the canonical versus non-canonical pathways 
remains to be determined. 

Weextended our findings with cultured macrophages by challenging 
wild type, Casp11 Mae Casp 11 ~, Casp1/11 double knockout and 
Casp1 Casp11 5 with a lethal dose of LPS (54mg kg~'), which is a 
model of acute septic shock (Fig. 4a and Supplementary Table 1). 
Consistent with previous reports®!°, Casp1I~” and Casp1/11 double- 
knockout mice were resistant to LPS challenge; whereas six of six 
Casp11~ mice were alive at 40h after injection, six of six wild-type 
and Casp1 1° mice had succumbed within 18h. Notably, no 
CaspICasp11"® mice survived beyond 26h. We conclude that 
caspase-11 rather than caspase-1 is the dominant effector of LPS- 
induced lethal septic shock. Previous studies implicating both 
caspase-11 and caspase-1 in this disease model*!° may have been 
misled by the assumption that published Casp1~~ mice expressed 
caspase-11 normally. 

Casp11', Casp1/11 double-knockout and Casp1~/Casp11"® mice 
all had less serum IL-1 and IL-18 at 12 h after LPS injection than wild- 
type or Casp11*’~ mice, indicating that both caspase-1 and caspase-11 
were necessary for IL-1B and IL-18 secretion in vivo*!°°’ (Fig. 4b). It 
should be noted that IL-1B and IL-18 are dispensable for LPS-induced 
lethality because mice lacking both IL-1 and IL-18 are as susceptible 
as wild-type mice*’. We speculate that caspase-11-mediated tissue 
damage, which probably does not require caspase-1, is responsible 
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Figure 3 | Caspase-1 and caspase-11 have stimulus-specific roles during 
inflammasome activation. a—c, BMDMs stimulated as indicated for 6 h were 
immunoblotted (a, c) or Casp11 transcripts were recovered by RT-PCR 

(b). Cont, medium alone. dKO, double knockout. d, IL-1f secretion by LPS- 
primed BMDMs stimulated for 16 h. e, LPS-primed BMDMs stimulated for 8 h. 
IL-1f and IL-18 were immunoblotted in cell supernatants (Sup), pro-IL-1B and 


Figure 4 | Caspase-11 rather than caspase-1 is required for LPS-induced 
lethality. a, Survival of mice (n = 6 per genotype) injected intraperitoneally 
(i.p.) with 54mgkg' LPS. Data are representative of two independent 
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experiments. Adjusted P values are supplied in Supplementary Table 1. 
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pro-IL-18 in cell extracts (Cell ext). f, LDH and IL-1 released from LPS- 
primed BMDMs after stimulation for 16h (ATP, for 8h). g, LPS-primed 
BMDMs stimulated for 8 h. Caspase-11 p26 and HMGB1 were immunoblotted 
in cell supernatants, pro-caspase-11 and HMGB1 in cell extracts. Graphs show 
the mean + s.d. of triplicate wells and are representative of three independent 


OWT 

© Casp11*/- 

@ Casp11-- 

@ Casp1/11 dKO 

© Casp1~-Casp11"9 


Cont LPS i.p. 


b, Serum IL-1, IL-18 and IL-1o at 12h after injection of 20mgkg | LPS. 
Circles represent individual mice. Error bars represent the mean + s.e.m. Data 
are representative of two independent experiments. 


for the lethal septic shock. Intriguingly, IL-1o released from damaged 
cells into the serum was equivalent in wild-type, Casp11*/~ and 
Casp1’Casp11'® mice, whereas very little IL-1 was detected in 
Casp11’ or Casp1/11 double-knockout mice (Fig. 4b). Mice lacking 
the IL-1 receptor are as susceptible as wild-type mice to LPS-induced 
lethal sepsis (data not shown), so it is not clear what causes the lethality 
downstream of caspase-11. Like Casp1~Casp11"§ mice, Nirp3" and 
Asc’ mice survived only slightly longer than wild-type mice following 
high dose LPS (Supplementary Fig. 7a), despite failing to make serum 
IL-1f and IL-18 (Supplementary Fig. 7b). This result confirms pre- 
vious studies showing that NLRP3 and ASC are essential for LPS- 
induced caspase-1 activation in vivo'’”’, and it supports the notion 
that the caspase-1-independent non-canonical pathway elicits lethal 
septic shock. The NLRP3-ASC-caspase-1 axis may amplify the shock 
response, however, as Nirp3~ or Asc” mice were relatively resistant 
to lower doses of LPS’””®. 

Pro-inflammatory caspase-11 triggers caspase-1-independent 
macrophage death and caspase-1-dependent IL-1 and IL-18 produc- 
tion in response to a subset of inflammasome activators that we refer to 
as non-canonical activators. Our in vivo data indicate, contrary to 
current thinking, that caspase-11 rather than caspase-1 may be the 
critical effector of deleterious inflammatory responses. Therefore, 
targeting human caspase-4 and caspase-5 may be more effective than 
caspase-1 inhibition in patients with sepsis. Our findings also highlight 
the need to revisit the role of caspase-1 versus caspase-11 in different 
mouse disease models, as so far all studies have used Casp 1/11 double- 
knockout mice. 


METHODS SUMMARY 
Mice. Casp11 exon 5 in C57BL/6 ES cells was flanked by loxP sites and then deleted 
with Cre recombinase. Casp1~Casp11'® mice were created by introducing a 
Casp11 BAC transgene into Casp1/11 double-knockout embryos. 
BMDM culture. BMDMs were primed with 500 ng ml LPS (E. coli 0111:B4) for 
Sh and then stimulated in OPTI-MEM with 500ng ml * LPS or infected with 
bacteria. IL-1, IL-1 and IL-18 secretion was measured by ELISA. For immuno- 
blotting, cells were lysed with RIPA buffer. Proteins in culture supernatants were 
precipitated with 7.2% trichloroacetic acid plus 0.15% sodium cholate. 
LPS-primed BMDMs were infected with P. aeruginosa (multiplicity of infection 
(m.0.i.) 25), E. coli (m.o.i. 20), C. rodentium (m.o.i. 20), V. cholerae (m.o.i. 50) or 
F. tularensis ssp. novidica strain U112 (m.o.i. 40) for 1.5h and then cultured in 
100 jg ml! gentamycin. Other stimulations included ATP (5 mM), MSU (100 pg 
ml '), CPPD (200 uugml'), CTB (20 ugml'), AC toxin (5 ugml'), C. difficile 
toxin B (0.2 pg ml ') and LLO toxin (10 Lg ml '). 
Endotoxic shock model. Mice (8-10 weeks old) were injected intraperitoneally 
with 54mgkg ! LPS (E. coli 0111:B4) and monitored 8 times daily for a total of 
6 days. For serum cytokine measurements, a separate cohort of mice received 
20 mgkg~' LPS and blood was collected 12h later. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. Nirp3-, Nirca", Nirp12’ ~ Asc’, Aim2”, P2rx7~ and PanxI~~ mice 
were described previously*'*” and were backcrossed to C57BL/6 for at least 10 
generations. Seven-to-ten-week-old male or female mice were used for experi- 
ments. Other mice included 129X1/SvJ (129X1), 129S1/SvImJ (12981), 129P3/J 
(129P3), BALB/c, SJL, DBA/1 and NOD/ShiLt) Casp1/11 double knockout 
(Jackson Laboratory), 129S6/SvEvTac (12986, Taconic), Nirp6 ~~, Nod1~ and 
NodI~“-Nod2’- double knockout (Thirumala-Devi Kanneganti, St. Jude 
Children’s Research Hospital). C57BL/6 Casp1/11 double-knockout (Casp1”~ 
Casp11'™”1?°™") mice were obtained by backcrossing NOD/ShiLt] Casp1/11 
double knockout to C57BL/6 for more than 10 generations. The Genentech animal 
care and use committee approved all mouse studies. Casp11~‘~ mice were genotyped 
with PCR primers (5’TGAAATGCATGTACTGAGAGCAAGG; 5'CAATTGA 
CTTGGGGATTCTGG and 5'GTCAGAGATGAAAGACTTTGCTGC) yielding 
a 475-bp wild-type DNA fragment (a 704-bp fragment is possible too) and a 337-bp 
mutant DNA fragment. 

Reagents. MSU, CPPD, flagellin (S. thyphimurium) and ultra-pure LPS (E. coli 
0111:B4) were from Invivogen. Other reagents included CTB, AC toxin, C. difficile 
toxin B, LLO toxin (List Biological Laboratories), Nigericin, poly(dA:dT) and ATP 
(Sigma). 

BMDM culture. Bone marrow cells were differentiated in DMEM with 10% 
endotoxin-free fetal bovine serum (Omega Scientific) and 20% M-CSF-conditioned 
medium for 5-6 days, then plated at ~1.0 X 10° cells ml! and cultured overnight. 
BMDMs were primed with 500 ng ml’ LPS for 5h and then cultured in OPTI- 
MEM media (Invitrogen) containing 500 ng ml" LPS or bacteria (P. aeruginosa, 
ATCC 10145; E. coli, ATCC 11775; C. rodentium, ATCC 51116; V. cholera, ATCC 
9459; F. tularensis ssp. novidica strain U112 from D. Monack, Stanford University). 
IL-1B (Meso), IL-1a (BD Bioscience) and IL-18 (MBL) were measured by ELISA. 
Cytotoxicity was measured by CytoTox 96 Non-Radioactive Cytotoxicity Assay 
(Promega). 

Immunoblotting. Caspase-1 was immunoblotted with rat anti-mouse caspase-1 
p20 (clone 4B4, Genentech) or rabbit anti-caspase-1 p10 (sc-514, Santa Cruz). A 
rabbit polyclonal antibody (GeneTex) detected IL-1. Other antibodies included 
rat anti-caspase-11 (clone 17D9, Novus Biologicals), rabbit anti-IL-18 (Biovision), 
rat anti-ASC (clone 8E4.1, Genentech), rabbit antic HMGB1 (GeneTex) and rabbit 
anti-B-actin (Novus Biologicals). 

RT-PCR. BMDM total RNA was prepared with an RNeasy kit (QIAGEN). The 
entire Casp11 coding region was amplified with a SuperScript One-Step RT-PCR 
for Long Templates kit (Invitrogen). Primers were: 5'ATGGCTGAAAAC 
AAACACCCT and 5’TCAGTTGCCAGGAAAGAGGTAG. Primer and probe 
sets used for TaqMan (Applied Biosystems) included Il1b 171 (sense 5'GAG 
TGTGGATCCCAAGCAAT, anti-sense 5’TACCAGTTGGGGAACTCTGC, 
probe 5’FAM-TGGAAAAACGGTTTGTCTTCA-TAMRA), Casp11 174 (sense 
5’'ACAATGCTGAACGCAGTGAG, anti-sense 5'CTGGTTCCTCCATTTCCA 


GA, probe 5'FAM-CATTCTTCAGTGTGGACCCA-TAMRA), and Rpl19 (sense 
5'GCGCATCCTCATGGAGCACA, anti-sense GGTCAGCCAGGAGCTTCT 
TG, probe 5’FAM-CACAAGCTGAAGGCAGACAAGGCC C-TAMRA). Samples 
were normalized by quantification of Rpl19 mRNA. 

DNA sequencing. Casp11 cDNAs amplified from C57BL/6 and 129S6 BMDMs 
were cloned into pcDNA3.1+ (Invitrogen) and sequenced. The caspase-11 C254A 
mutant was created with a QuikChange site-directed mutagenesis kit (Stratagene). 
Casp11 genome sequence in 129$1 BAC clone (CT7-292K12; Invitrogen) was 
sequenced with primer 5'CATGTCTAACTATATTGAAATGTGAA. 

BAC transgenic mouse. A C57BL/6 mouse BAC clone (RP23-78A8, Invitrogen) 
containing 156 kb of the Casp1 and Casp11 genomic region was characterized by 
DNA fingerprinting and transformed into SW102 cells. This BAC contains 65 kb 
DNA upstream of Casp11 and 53 kb downstream of Casp11. To create a BAC 
transgene with functional Casp11 and inactivated Casp1, an Frt-PGK-em7-Neo- 
Frt cassette was synthesized (Blue Heron/Origene) and used to delete, using 
recombineering, a 1.4-kb region encompassing Casp1 exons 1 and 2 (NCBI37/ 
mm?9 assembly chr9: 5,298,191-5,299,550). Correctly targeted BAC DNA was 
transformed into Flp-expressing SW105 cells**** to remove the Neo selection 
marker. The modified region of the BAC was confirmed by DNA sequencing. 
Transgenic mice carrying the modified Casp1/Casp11 BAC were obtained using 
standard pronuclear microinjection methods* using Casp1/11 double-knockout 
embryos. Casp1~'"Casp11"® mice were genotyped with PCR primers (5’/ACAGA 
AGAGAGATCTGAGCCTTCA and 5’ACACAGACTTGGACCCTGTTAGT 
AG) yielding a 379 bp Casp1 1'§ DNA fragment. 

Endotoxic shock model. Mice (8-10 weeks old) were injected intraperitoneally 
with 54mgkg ’ LPS (E. coli 0111:B4; Sigma) and monitored 8 times daily for a 
total of 6 days. For serum cytokine measurements, a separate cohort of mice 
received 20 mg kg ' LPS and blood was collected 12 h later. Serum cytokines were 
measured by IL-1a ELISA (BD Bioscience), IL-1; ELISA (Meso) and IL-18 
luminex assay (Bio-Rad). 
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Chromatin modifiers regulate lifespan in several organisms, raising the question of whether changes in chromatin states 
in the parental generation could be incompletely reprogrammed in the next generation and thereby affect the lifespan of 
descendants. The histone H3 lysine 4 trimethylation (H3K4me3) complex, composed of ASH-2, WDR-5 and the histone 
methyltransferase SET-2, regulates Caenorhabditis elegans lifespan. Here we show that deficiencies in the H3K4me3 
chromatin modifiers ASH-2, WDR-5 or SET-2 in the parental generation extend the lifespan of descendants up until the 
third generation. The transgenerational inheritance of lifespan extension by members of the ASH-2 complex is 
dependent on the H3K4me3 demethylase RBR-2, and requires the presence of a functioning germline in the 
descendants. Transgenerational inheritance of lifespan is specific for the H3K4me3 methylation complex and is 
associated with epigenetic changes in gene expression. Thus, manipulation of specific chromatin modifiers only in 
parents can induce an epigenetic memory of longevity in descendants. 


Transgenerational epigenetic inheritance has been described for some 
traits, including flower symmetry and colour in plants’*, progeny 
production in worms*, heat stress response and eye colour in 
Drosophila*~’, and coat colour in mammals*'°. However, the trans- 
generational epigenetic inheritance of longevity, and more generally 
of complex traits, is largely undefined. Chromatin modifiers have 
been shown to regulate longevity in several species'’"'’, raising the 
possibility that chromatin changes in parents might not be entirely 
reset between generations and thereby also regulate longevity in 
descendants. Deficiencies in the H3K4me3 regulatory complex com- 
posed of ASH-2, WDR-5 and SET-2 extend lifespan in C. elegans". 
We asked if perturbation of members of the H3K4me3 regulatory 
complex (ASH-2, WDR-5 and SET-2) only in the parental generation 
could regulate the lifespan of descendants in subsequent generations 
in C. elegans. 


Transgenerational inheritance of longevity 


We first focused on WDR-5, a conserved regulatory component of the 
ASH-2 complex”” whose depletion decreases H3K4me3 levels’*?°-* 
and extends lifespan in worms’’. To test whether longevity could be 
inherited in a transgenerational epigenetic manner, we crossed 
wild-type (+/+) males with wdr-5(0k1417) mutant (wdr-5/wdr-5) 
hermaphrodites to generate Fl heterozygous hermaphrodites 
(Fig. 1a). These Fl heterozygous hermaphrodites were genotyped 
and then self-crossed to generate F2 hermaphrodites (wild type, 
heterozygous and homozygous at the wdr-5 locus), which were geno- 
typed after they had laid F3 generation progeny. In parallel, we 
crossed a wild-type male with a wild-type hermaphrodite to generate 
pure wild-type descendants and control for any beneficial longevity 
effects that could come from crossing rather than self-mating 
(Fig. 1a). Longevity of genetically wild-type descendents from wild- 
type or wdr-5 mutant ancestors was compared in the F3, F4 and F5 
generations. Interestingly, genetically wild-type F3 descendants from 
PO wdr-5 parents (+/+ from PO wdr-5 parents) still showed a ~20% 
extension of lifespan (P< 0.0001) compared to descendants from 


pure wild-type parents (+/+ from PO WT parents) (Fig. 1b). This 
20% lifespan extension was similar in magnitude to the lifespan exten- 
sion of pure F3 wdr-5(0k1417) mutants (wdr-5/wdr-5) (Fig. 1b). The 
lifespan of genetically wild-type descendants from wdr-5(0k1417) 
mutant parents (+/+ from PO wdr-5 parents) was still extended in 
the F4 generation (Fig. 1c), but was no longer extended in the F5 
generation (Fig. 1d). Thus, wdr-5 deficiency only in the parental 
generation can extend the lifespan of subsequent generations. 
Because the lifespan of F5 generation wild-type descendants from 
wdr-5 mutant parents is no longer extended, the lifespan extension 
observed in the F3 and F4 generations is unlikely to be due to extrane- 
ous mutations that might have been present in the parental wdr-5 
mutant strain. Instead, the transgenerational inheritance of longevity 
may be due to epigenetic changes in H3K4me3 itself or in another 
molecule that can only be inherited for a limited number of generations. 

We next asked if a transgenerational epigenetic heritability of life- 
span was also observed with SET-2, the H3K4me3 methyltransferase 
enzyme that functions together with ASH-2 and WDR-5 to regulate 
H3K4me3 levels'*”?* and longevity in C. elegans’? (Fig. 2). Similar to 
what we observed for wdr-5, genetically wild-type descendants from 
set-2(0k952) mutants still had a ~30% extension of lifespan 
(P< 0.0001) in the F3 and F4 generations (Fig. 2b, c), but not in 
the F5 generation (Fig. 2d). Genetically wild-type F3 descendants 
from the reverse cross—P0 set-2(0k952) males crossed with wild-type 
hermaphrodites—were also long-lived (Supplementary Table 1), 
indicating that transgenerational inheritance of longevity is not linked 
to a particular gender in the parental generation. 

ASH-2 is important for the conversion of H3K4 dimethylation 
(H3K4me2) to H3K4me3 (ref. 23). Knockdown of ash-2 by RNA 
interference (RNAi) in worms decreases global H3K4me3 levels at 
the L3 stage’*”’ and extends longevity’*. We asked if ash-2 knockdown 
only in the parental generation affected the lifespan of several 
generations of descendants. Wild-type parent worms (P0) were 
placed on plates with bacteria expressing RNAi to ash-2 from birth 
to the larval stage L4, then switched every day for 3 days onto plates 
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Figure 1 | Genetically wild-type descendants from wdr-5 mutant parents 
have extended lifespan for several generations. a, Scheme for generating 
wild-type (+/+) descendants from wdr-5(0k1417) mutant worms (wdr-5/wdr- 
5). b-d, Lifespan of genetically wild-type F3 (b), F4 (c) and F5 (d) descendants 


containing OP50-1 bacteria and streptomycin to selectively prevent 
the growth of RNAi-expressing bacteria (Fig. 3a). Endogenous ash-2 
messenger RNA and ASH-2 protein levels were significantly 
decreased in the PO generation, but returned to normal levels in 
subsequent generations (Fig. 3b, c), indicating that ash-2 RNAi is 
not itself inherited. The lifespan of worms from the Fl, F2 and F3 
generations in which ash-2 had been knocked down only in the PO 
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Figure 2 | Genetically wild-type descendants from set-2 mutant parents 
have extended lifespan for several generations. a, Scheme for generating 
wild-type (+/+) descendants from set-2(0k952) mutant worms (set-2/set-2). 
b-d, Lifespan of genetically wild-type F3 (b), F4 (c) and F5 (d) descendants 
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of wdr-5(0k1417) mutant worms (+/+ from PO wdr-5 parents) compared to 
descendants of wild-type worms (+/+ from PO WT parents). Mean lifespan 
and statistics are presented in Supplementary Table 1. 


parental generation was still significantly extended (19-27%, 
P<0.0001) compared to that of descendants of worms treated with 
empty vector control in the PO parental generation (Fig. 3d-g). By 
contrast, F4 generation descendants no longer had extended lifespan 
(Fig. 3h). We obtained similar results after bleaching PO worms to 
avoid potential carry over of RNAi-expressing bacteria (data 
not shown). Thus, alteration of the components of the H3K4me3 
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from set-2(0k952) mutant worms (+/+ from PO set-2 parents) compared to 
descendants of wild-type worms (+/+ from P0 WT parents). Mean lifespan 
and statistics are presented in Supplementary Table 1. 
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Figure 3 | Knockdown of ash-2 only in the parental generation extends 
lifespan for several generations. a, Scheme for generating wild-type 
descendants from RNAi-treated parents. b, ash-2 mRNA levels at day 7 in 
different generations of worms treated with ash-2 RNAi or empty vector 
(control) only in the PO generation. Mean + s.e.m. of three independent 
experiments. ***P = 0.0002 with paired t-test. c, ASH-2 protein levels at L3 
stage in different generations of worms treated with ash-2 RNAi or empty 


methyltransferase complex (ASH-2, WDR-5 and SET-2) in parents 
affects the lifespan of descendants, supporting the possibility that 
transgenerational inheritance of longevity is due to epigenetic changes 
that may only be inherited for a limited number of generations. 


Importance of the H3K4me3 demethylase and germline 


The H3K4me3 demethylase RBR-2 is necessary for the lifespan exten- 
sion caused by deficiencies in members of the ASH-2 complex”. We 
asked if the transgenerational extension of longevity induced by defi- 
ciencies in members of the ASH-2 complex is dependent on RBR-2. 
The lifespan of genetically wild-type F3 descendants from PO wdr-5 
parents (+/+ from PO wdr-5 parents) was no longer extended in the 
presence of rbr-2 RNAi (Fig. 4a, b). Similarly, F3 wild-type descendants 
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Age (days) 
vector (control) only in the PO generation. Representative of two independent 
experiments. *, non-specific band; arrow, ASH-2; arrowhead, protein related to 
ASH-2, possibly a degradation product. d-h, Lifespan of PO (d), F1 (e), F2 
(f), F3 (g) and F4 (h) generations of worms with RNAi knockdown of ash-2 or 
control RNAi (empty vector) in parents only. Mean lifespan and statistics are 
presented in Supplementary Table 2. 


from set-2;rbr-2 parents (+/+ from PO set-2;rbr-2 parents) were no 
longer long-lived (Supplementary Fig. 1). Together, these data indicate 
that the transgenerational inheritance of longevity due to deficiencies in 
H3K4 trimethylation complex members is dependent on the H3K4me3 
demethylase RBR-2. The fact that the longevity of wild-type descen- 
dants of wdr-5 and set-2 mutants is reverted by deficiencies in rbr-2 also 
indicates that this extended lifespan is unlikely to result from extrane- 
ous mutations in wdr-5 or set-2 strains. rbr-2 mutation or knockdown 
did not lead to a shortening of lifespan in descendants (Supplementary 
Fig. 2), indicating that by itself, RBR-2 deficiency does not affect 
longevity in a transgenerational manner. 

Longevity due to modulation of the ASH-2 complex is dependent on 
a functioning germline’’. To test if wild-type descendants of worms 
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Figure 4 | Transgenerational inheritance of longevity by deficiencies in 
ASH-2 complex members is dependent on the presence of the H3K4me3 
demethylase RBR-2 and an intact germline. a, b, Lifespan of genetically wild- 
type F3 descendants from wdr-5(0k1417) mutant worms (+/+ from PO wdr-5 
parents) in the presence of empty vector (control RNAi) (a) or rbr-2 RNAi 


with deficiencies in ASH-2 complex members also require the presence 
of a functioning germline for lifespan extension, we used temperature- 
sensitive feminized fem-3(e2006) mutant worms, which do not produce 
mature eggs at the restrictive temperature**. Knockdown of ash-2 and 
wdr-5 only in parents extended the lifespan of the F1 generation in fem- 
3(e2006) mutant worms at the permissive temperature (16 °C), but not 
at the restrictive temperature (25 °C) (Supplementary Fig. 3). To inde- 
pendently examine if the germline is required for the longevity of 


aio 31.2%, P< 0.0001 
0.84 
oO 
2 
= 98) [Po 
% 9.44 — Control RNAi 
gy r 
ic — set-15 RNAi 
0.24 — set-9 RNAi 
— utx-1 RNAi 
04 — war-5 RNAi 
0 5 10 15 20 25 30 35 
Age (days) 


c 

PO sas) — +/+6) 

hg 6 wasteg +/dat-2 3 
El aa 


+G +idat-2$ daf-2/daf-2 
ng cut chat 2g 


+/+’ —— daf-2/daf-2 g 


Fa 


Figure 5 | Other longevity regulators do not have a transgenerational effect 
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worms treated with set-9, set-15, utx-1 and wdr-5 RNAior control RNAi (empty 
vector) only in the PO generation. c, Scheme for generating wild-type (+/+) 
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(b). ¢, d, Lifespan of pgl-1 F3 descendants from wdr-5(0k1417);pgl-1(bn101) 
mutant worms (pgl-1/pgl-1 from PO wdr-5;pgl-1 parents) compared with 
descendants from pgl-1(bn101) worms at the permissive temperature (16 °C) 
(c) and at the restrictive temperature (25 °C) (d). Mean lifespan and statistics 
are presented in Supplementary Tables 3 and 4. 


wild-type descendants of mutants of ASH-2 complex members, we used 
pgl-1(bn101) temperature-sensitive mutants that cannot form a func- 
tioning germline at the restrictive temperature” (Fig. 4c, d). F3 genera- 
tion pgl-1 descendants from wdr-5;pgi-1 mutant parents no longer had 
an extended lifespan compared to pgl-1 descendants from pgl-1 parents 
at the restrictive temperature (25 °C) (Fig. 4c, d). Thus, a functioning 
adult germline is necessary for the long lifespan of wild-type descen- 
dants of parents with deficiencies in members of the ASH-2 complex. 
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progeny from daf-2(e1370) mutant worms (daf-2/daf-2). d, Lifespan of 
genetically wild-type F3 descendants from daf-2(e1370) mutant worms (+/+ 
from PO daf-2 parents). Mean lifespan and statistics are presented in 
Supplementary Tables 2 and 4. 
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Specificity of epigenetic memory of lifespan 

We then asked if the transgenerational inheritance of longevity is 
specific to H3K4me3 modifiers or if it is also observed with chromatin 
modifiers of other marks (set-9, set-15 and utx-1), and more generally 
with genes in known longevity pathways: insulin signalling (age-1 
and dod-23), mitochondria (cco-1 and cyc-1) and stress resistance 
(asm-3)!*"'7"186-32, Tn contrast to what we observed for ash-2 and 
wdr-5, knockdown of set-9, set-15, utx-1, age-1, asm-3, cco-1, cyc-1 
and dod-23 only in parents did not extend the lifespan of the F1 
generation (Fig. 5a, b and Supplementary Fig. 5). Similarly, genetically 
wild-type F3 descendants from long-lived daf-2(e1370)°? mutant 
worms (+/+ from PO daf-2 parents) had no significant extension 
of lifespan (6% P= 0.1955) (Fig. 5c, d). Collectively, these findings 
indicate that transgenerational extension of longevity is relatively 
specific to H3K4me3 chromatin modifiers, and further indicate that 
the H3K4me3 mark may be important for epigenetic memory of life- 
span between generations. As SET-9, SET-15 and UTX-1, unlike 
members of the ASH-2 complex, regulate lifespan in a manner that 
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Figure 6 | Genetically wild-type descendants from wdr-5 mutant parents 
exhibit differences in gene expression, but not in global H3K4me3 levels, 
compared to descendants from wild-type parents. a, b, Global H3K4me3 
levels in the F4 generation by western blot (a) or in the F3 generation by 
immunocytochemistry (b) of L3 worms from genetically wild-type descendants 
from wdr-5 parents (+/+ from wdr-5) or wild-type parents (+/+ from WT), 
and wdr-5 mutants (wdr-5). Scale bars, 50 um. c, Scheme for generating wild- 
type descendants from a cross between wdr-5(0k 1417) null mutant worms and 
wild-type worms. Symbols represent RNA samples from L3 worms from three 
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is independent of the germline , it is also possible that trans- 
generational inheritance of longevity is specific to chromatin regula- 
tors that act in the germline. 


12,17,18 


Transgenerational inheritance of gene expression 


We next determined if transgenerational inheritance of lifespan is 
associated with heritable changes in H3K4me3. Western blot and 
immunocytochemistry showed that global H3K4me3 levels were 
not decreased in F3 and F4 generation genetically wild-type descen- 
dants from wdr-5 and set-2 parents or in F1 and F2 generation des- 
cendants from ash-2 or wdr-5 knockdown only in parents (Fig. 6a, b 
and Supplementary Figs 6, 7). Thus, transgenerational inheritance of 
lifespan is unlikely to be mediated by a heritable global decrease in 
H3K4me3 levels. Transgenerational inheritance of lifespan might be 
associated with heritable local changes of H3K4me3 at certain loci, 
which could affect expression of certain genes involved in longevity. 
To test this idea, we compared gene expression genome-wide in wild- 
type descendants from wdr-5 mutant and wild-type ancestors, and 
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symbols) day of egg-laying. d, Unbiased hierarchical clustering of WDR-5 
regulated genes from the first day of egg-laying (Supplementary Table 9). 
Pvclust values are displayed on each node of the dendrogram. Values superior 
to 95 are considered significant. e, Principal component analysis (PCA) of the 
entire microarray data sets from the first day of egg-laying (Supplementary 
Table 5). PC, principal component. Symbols represent gene expression data 
from L3 worms collected on the first day of egg-laying (Fig. 6c). 
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pure wdr-5 mutant descendants in the F4 and F5 generations (Fig. 6c). 
For each condition, we collected triplicates of L3 stage worms from the 
first or second day of egg-laying (Fig. 6c), with the first day of egg-laying 
corresponding to the samples used for lifespan assays. Statistical analysis 
of microarray (SAM) identified 759 genes that were differentially regu- 
lated in wdr-5 pure mutants compared to wild-type worms, regardless of 
the generation (Supplementary Table 7) and egg-laying day (Sup- 
plementary Fig. 8a) (P = 2.38 x 10 ''®, hypergeometric probability). 
These WDR-5-regulated genes are enriched for longevity, develop- 
ment and growth gene ontology terms (Supplementary Fig. 8b), 
consistent with WDR-5’s reported functions’*’'”. As expected, 
WDER-5-regulated genes significantly overlap with ASH-2-regulated 
genes’? (P= 6.14 10 '”, hypergeometric probability, Supplemen- 
tary Fig. 8c) and are enriched for H3K4me3 (refs 34, 35) 
(P = 2.49 X 10 *4, hypergeometric probability, Supplementary Fig. 8d). 
These observations indicate that WDR-5 functions together with ASH- 
2 to regulate a subset of genes by modulating H3K4me3 at these loci. 

We asked if the expression of some WDR-5-regulated genes might 
be transgenerationally inherited. Interestingly, a significant subset of 
WDER-5-regulated genes was still differentially regulated in wild-type 
descendants from wdr-5 mutant worms in the F4 generation (Fig. 6d 
and Supplementary Fig. 10a), but not in the F5 generation (Fig. 6d and 
Supplementary Fig. 10a), consistent with the return to a normal life- 
span in the F5 generation. Unbiased hierarchical clustering analysis 
showed that WDR-5-regulated genes in wild-type descendants from 
wdr-5 mutant versus wild-type parents still clustered separately in the 
F4, but not the F5 generation (Fig. 6d and Supplementary Fig. 10a). 
Principal component analysis (PCA) confirmed that overall gene 
expression in wild-type descendants from wdr-5 parents versus 
wild-type parents is easily distinguishable in the F4, but not the F5 
generation (Fig. 6e and Supplementary Fig. 10b). Genes with trans- 
generational inheritance of expression were slightly more enriched for 
H3K4me3 than expected by chance (P = 0.0123 and P = 0.0769 for 
the first and second day of egg-laying, respectively, hypergeometric 
probability), and may represent the genes that are the most affected by 
the loss of the H3K4me3 mark. A number of these genes are known 
longevity regulators and are expressed in the germline (Supplemen- 
tary Table 7). Gene ontology analysis of genes with transgenerational 
inheritance of expression shows enrichment for different types of 
metabolic pathways (Supplementary Figs 9, 10c), raising the possibility 
that changes in metabolism may play a role in the heritability of the 
phenotype. The genes with transgenerational inheritance of expression 
were different on the first versus second day of egg-laying, and were no 
longer identified when samples from different days of egg-laying were 
pooled (E.L.G. and A.B., data not shown). This could be because worms 
produced on the first day of egg-laying might be more susceptible to 
H3K4me3 depletion, because each collection day may represent a dif- 
ferent snapshot in the rapidly changing L3 stage’, or because of inherent 
stochasticity in the transgenerational inheritance of gene expression. 
Overall, these results indicate that ancestral H3K4me3 status influences 
the gene expression of descendants for several generations. 


Discussion 


Our study provides the first example of epigenetic inheritance of 
longevity. Histone methylation marks and DNA methylation are 
generally, but not always, erased between generations with epigenetic 
reprogramming””**. Our observations are consistent with the notion 
that H3K4me3 at specific loci may not be completely erased and 
replenished. Alternatively, the ASH-2/WDR-5/SET-2 complex could 
control the expression of the genes responsible for the erasure and 
replenishment of histone methylation marks between generations. 
Modulation of H3K4me3 modifiers in parents may also affect an 
unidentified protein or RNA that could in turn be inherited and cause 
lifespan changes. Interestingly, H3K4me3 regulators have been sug- 
gested to have a role in the inheritance of eye colour in Drosophila** 
and of active transcriptional states in Dictyostelium”. As the ASH-2 
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H3K4me3 regulatory complex is conserved from yeast to humans, 
manipulations of this complex in parents might have a heritable effect 
on longevity in mammals. 


METHODS SUMMARY 


Genetic crosses. For genetic crosses, wild-type or pgl-1 male siblings were crossed 
to hermaphrodite siblings (to generate wild-type descendants) or to hermaphrodite 
mutants (to generate wild-type and mutant descendants). F1 hermaphrodites were 
allowed to lay progeny and were then genotyped by single-worm genotyping to 
ensure that they were heterozygous at the loci of the mutations of interest. Twenty 
F2 worms were placed on individual plates and allowed to lay ~40 eggs. F2 parents 
were then genotyped by single-worm genotyping and the lifespan of three inde- 
pendent lines was analysed. Progeny from each of these independent lines were 
collected after synchronized egg-laying for the F4 and F5 generations and analysed 
in lifespan assays. For microarray analysis, RNA samples were isolated at the F4 and 
F5 generations at the larval stage L3. Each sample was prepared from independent 
F2 clonal parents in triplicate with ~1,000 worms each. 

Lifespan assays. Worm lifespan assays were performed at 20°C, unless noted 
differently, without 5-fluoro-2'-deoxyuridine (FUdR), as described previously”. 
For each lifespan assay, 90 worms per condition were used in three plates (30 
worms per plate), unless noted differently. Worms that underwent ‘matricide’, 
that exhibited ruptured vulva, or crawled off the plates were censored. Statistical 
analyses of lifespan were performed on Kaplan-Meier survival curves in Stat View 
5.0.01 by log rank (Mantel-Cox) tests. The values from the Kaplan-Meier curves 
are included in the Supplementary Tables. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Worm strains. wdr-5(0k1417) and set-2(0k952) strains were provided by the 
Caenorhabditis Genetics Center. Wild-type (N2), wdr-5(0k1417) and _ set- 
2(ok952) strains were genotyped for mut-16(mg461), a mutation that affects 
RNAiefficiency and that was found as an extraneous mutation in several laboratory 
strains. These strains did not contain the mut-16(mg461) mutation. wdr-5(0k1417) 
mutant worms were backcrossed four to nine times by crossing wild-type males 
with wdr-5(0k1417) hermaphrodites. set-2(0k952) mutant worms were backcrossed 
four to six times by crossing wild-type males with set-2(0k952) hermaphrodites. The 
transgenerational inheritance of longevity was similar, both in terms of magnitude 
and number of generation, whether wdr-5(0k1417) and set-2(0k952) worms were 
backcrossed four to nine times or four to six times, respectively (see Supplementary 
Table 1), arguing against a simple backcrossing effect to explain the increased 
lifespan of wild-type descendants from wdr-5 or set-2 parents. rbr-2(tm1231) 
mutant worms were backcrossed seven times, daf-2(e1370) mutant worms were 
backcrossed an additional two times by our lab, set-2(0k952) and rbr-2(tm1231) 
were backcrossed two times and six times, respectively, before being crossed 
together to generate set-2(0k952);rbr-2(tm1231) and then crossed an additional 
time to wild-type worms; wdr-5(0k1417) and pgl-1(bn101ts) were backcrossed four 
times each before being crossed together to generate wdr-5(0k1417);pgl-1(bn101ts) 
and then crossed an additional time to pgi-1(bn101ts) worms. For crosses involving 
set-2(0k952);rbr-2(tm1231) mutant worms, six F3 progeny were genotyped for each 
independent line to ensure the genotype of the second mutant loci. Temperature- 
sensitive fem-3(e2006) mutant worms were either maintained at 16°C or were 
switched to 25°C at birth and maintained at this temperature for the entirety of 
their lifespan. Temperature-sensitive pgl-1(bn101) and wdr-5(ok1417);pgl- 
1(bn101) mutant worms were either maintained at 16°C or were switched to 
25 °C at the L4 stage in F2 parents. F3 progeny from these worms was maintained 
at 16°C or 25 °C for the entirety of their lifespan. 

RNA interference. Adult worms were placed on NGM plates containing ampicillin 
(100 mg ml ') and IPTG (0.4mM) seeded with the respective bacteria and 
removed after 4-6 h to obtain synchronized populations of worms. HT115 (DE3) 
bacteria transformed with vectors expressing RNAi to the genes of interest were all 
obtained from the Ahringer library (a gift from M. W. Tan), except RNAi to rbr-2 
that was from the Open Biosystems library (a gift from K. Shen). At the L4 stage, PO 
worms were moved to NGM plates containing streptomycin (300 jig ml” ') seeded 
with OP50-1 bacteria, which are streptomycin-resistant, to eliminate any poten- 
tially remaining RNAi HT115 (DE3) bacteria, which are streptomycin-sensitive. PO 
worms were switched to fresh OP50-1 seeded plates every day until day 6 of life (day 
2 of adulthood). Day 6 PO worms were allowed to lay eggs for 4-6 h and progeny 
from that stage were picked from these plates and their lifespan was examined. 
Subsequent generations were obtained by placing young adult F1, F2, F3, or F4 
worms on fresh OP50-1 seeded plates for 4-6 h. To perform RNAi in fem-3(e2006) 
mutant worms, one set of PO worms was maintained at 16 °C to allow them to lay 
eggs for the F1 generation, while a second set of PO worms was analysed in lifespan 
assays at both 16 °C and 25 °C. RNAi to rbr-2 was initiated at the eggs or L1 stage of 
F3 generation wdr-5(0k1417) mutant worms, wild-type descendants of wdr- 
5(0k1417) mutant worms, and wild-type descendants of wild-type parents. 
Real-time quantitative reverse transcription polymerase chain reaction (qRT- 
PCR). Two hundred worms were picked to NGM plates with OP50-1 bacteria 
overnight 2 days in a row. Worms were then picked to NGM plates without 
bacteria and washed three times with M9 buffer (22mM KH,PO,, 34mM 
K,HPO,, 86mM NaCl, 1mM MgSO,). Worm pellets were resuspended in 
TRIzol (Invitrogen), followed by six freeze-thaw cycles in liquid N. One pg of 
total RNA was reverse-transcribed with oligo dT primers using Superscript II 
reverse transcriptase (Invitrogen) according to the manufacturer’s protocol. Real 
time PCR was performed on a Bio-Rad iCycler or Roche LightCycler 480II using 
iQ SYBR green (Bio-Rad) or LightCycler480 SYBR green I master (Roche) with 
the following primers: pan-actin F: 5’-TCGGTATGGGACAGAAGGAC:3’, 
pan-actin R: 5'-CATCCCAGTTGGTGACGATA-3’, ash-2 F: 5'-CGATCGAAA 
CACGGAACGA-3’, ash-2 R: 5'-TGCCGGAATCTGCAGTTTTT-3’, wdr-5 F: 
5'-CCCTGAAACAATACACTGGACACG-3', wdr-5 R: 5'-AACTGGATGAC 
AATCGGAGGC-3’. The experiments were conducted in duplicate and the 
results were expressed as 2((target gene number of cycles - pan-actin number of cycles) 
Protein analysis by western blot. Worms were synchronously grown to the L3 
stage and washed off plates with M9 buffer (22 mM KH»PO,, 34mM K3HPO,, 
86 mM NaCl, 1 mM MgSO,). Worms were washed several times in M9 buffer and 
snap-frozen in liquid N>. Sample buffer (2.36% SDS, 9.43% glycerol, 5% 
B-mercaptoethanol, 0.0945 M Tris HCl pH6.8, 0.001% bromophenol blue) was 
added to worm pellets and they were repeatedly snap-frozen in liquid No. Worm 
extracts were sonicated three times for 30 s at ~15 W (VirSonic 600) and boiled 
for 2 min before being resolved on SDS-PAGE (10% or 14%) and transferred 
to nitrocellulose membranes. The membranes were incubated with primary 


antibodies (H3K4me3 (Abcam ab8580, Millipore 07-473), 1:500; H3 (Abcam 
ab1791), 1:1,000; ASH-2 antibody” (a gift from B. J. Meyer), 1:2,000, alpha- 
tubulin (Sigma T9026), 1:1,000), and the primary antibodies were visualized 
using horseradish-peroxidase-conjugated anti-rabbit secondary antibody 
(Calbiochem 401393) and ECL Plus (Amersham Biosciences). 

Whole-mount immunocytochemistry. Worms were washed several times to 
remove bacteria and resuspended in fixing solution (160 mM KCl, 100 mM Tris- 
HCl pH7.4, 40mM NaCl, 20mM Na,EGTA, 1mM EDTA, 10mM spermidine 
HCl, 30mM PIPES pH 7.4, 1% Triton X-100, 50% methanol, 2% formaldehyde) 
and subjected to two rounds of snap freezing in liquid N3. The worms were fixed at 
4°C for 30 min and washed briefly in T buffer (100 mM Tris HCl pH 7.4, 1 mM 
EDTA, 1% Triton X-100) before a 1 h incubation in T buffer supplemented with 1% 
B-mercaptoethanol at 37 °C. The worms were washed with borate buffer (25 mM 
H3BO3, 12.5mM NaOH pH 9.5) and then incubated in borate buffer containing 
10mM DTT for 15 min. Worms were blocked in PBST (PBS pH 7.4, 0.5% Triton 
X-100, 1 mM EDTA) containing 1% BSA for 30 min and incubated overnight with 
H3K4me3 antibody (Millipore 07-473; 1:100) and with Alexa Fluor 594 secondary 
antibody (Invitrogen; 1:25-1:100). DAPI (2 mg ml © ') was added to visualize nuclei. 
The worms were mounted on a microscope slide and visualized using a Leica SP2 
confocal system or a Zeiss Axioskop2 plus fluorescence microscope. 
Single-worm genotyping. Single worms were placed in 5 pl of worm lysis buffer 
(50 mM KCl, 10 mM Tris pH 8.3, 2.5 mM MgCl, 0.45% NP40, 0.45% Tween-20, 
0.01% gelatin (w/v) and 60 mg ml | proteinase K), and incubated at -80 °C for 
1h, 60 °C for 1h, and then 95 °C for 15 min. PCR reactions were performed using 
the following primers: set-2 F: 5'-TGAAAGGATGATACTCGTGGGC-3’, set-2 
R: 5'-CGATGAGAGAAAGGGGATTTTGTAAC-3’, wdr-5 F: 5'-TTGTGTGT 
TCGCTGTGCATG-3’, wdr-5 R: 5'-GTATTTGCTCTCGGTCGATC-3’, mut-16 
F; 5'-AATATTCGATCGGCAAGCAG-3’, mut-16 R: 5'-CCCGCCGATACAG 
AAACTAA-3', rbr-2 F: 5'-CAAGTGTCGTGTGATGCTGTGG-3’, rbr-2 R: 
5'-TGGCGATTGGAAACTCCGAG-3’, pgl-1 F: 5'-TGATGTGATTGCCGAG 
GAACAC-3', pgl-1 R; 5'-GCTGAAGAAGACTGAAGACGCTAAG-3’, daf-2 
F: 5'-ACCTGGAGTCGCTCAAGTTTTG-3’, daf-2 R: 5'-TGCTTCGCTTTCAT 
CGGTGTC-3’. 

PCR reactions were performed according to the manufacturer’s protocol 
(Qiagen) and PCR reactions were resolved on agarose gels. daf-2 PCR products 
were digested with BlpI to distinguish between wild-type and daf-2(e1370) geno- 
types. pgl-1 PCR products were digested with Msel to distinguish between wild- 
type and pgl-1(bn101) genotypes. 

Microarray analysis. Total RNA was isolated using an RNAqueous kit (Ambion). 
Microarray hybridization was performed at the Stanford Protein and Nucleic 
Acid facility with oligonucleotide arrays (Affymetrix, GeneChip C. elegans 
Genome Arrays). The raw unfiltered microarray results are deposited at the 
Gene Expression Omnibus (GEO) under the Subseries entry GSE31043. 
Background adjustment and normalization was performed with RMA (Robust 
Multiarray Analysis). Two-class unpaired analysis in significance analysis of 
microarrays (SAM) was performed with 100 permutations, a 10° seed for the 
random number generator and a 5% false discovery rate (FDR) to compare gene 
expression in wdr-5 mutants and wild-type descendants from wild-type parents. 
To obtain a 5% FDR, a 1.06 delta value was used for samples collected at the first 
day of egg-laying (day 1) and a 0.93 delta value for samples collected at the second 
day of egg-laying (day 2). Significantly changed probes from these two lists were 
then used to compare wild-type descendants from wild-type parents to wild-type 
descendants from wdr-5 parents in each generation using a 5% FDR. To obtain a 
5% FDR, a 0.66 delta value was used for the F4 generation at day 1, a 0.09 delta 
value was used for the F5 generation at day 1, a 0.92 delta value was used for the F4 
generation at day 2, and a 0.61 delta value was used for the F5 generation at day 2. 
Similar results for wild-type descendants from wdr-5 parents compared to wild- 
type descendants from wild-type parents were observed when SAM was per- 
formed with the entire normalized lists of genes. 

Hierarchical clustering. A complete linkage hierarchical clustering on the subset 
of WDR-5 regulated genes for each day (Supplementary Tables 9 and 10) was 
performed using Gene Cluster 3.0. Clustering results were analysed further with 
Java TreeView. Further statistical analysis was performed using Pvclust**. For the 
clustering analysis, genes and then arrays were centred using the mean. The R 
package Pvclust was used to apply complete linkage hierarchical clustering. As the 
data were centred, the uncentred Pearson correlation coefficient was used as a 
similarity measure, which was subsequently modified to dissimilarity by sub- 
tracting from 1. Experiments were conducted with 1,000 bootstrap replications. 
Principal component analysis. Principal component analysis (PCA)** was con- 
ducted on the entire normalized lists of genes (Supplementary Tables 5 and 6). 
The data were scaled to obtain unit variance before conducting the PCA analysis. 
The Prcomp function in the R package ‘Stats’ was used. The first and the second 
principal components (PC1 and PC2) were plotted. 
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H3K4me3 chromatin immunoprecipitation (ChIP)-chip data set from 
modENCODE and comparison between data sets. The H3K4me3 ChIP-chip 
data set was generated by the modENCODE consortium from worms at the L3 
stage***. The data, protocols, and antibody information can be accessed at the 
modENCODE Data Coordination Center (http://intermine.modencode.org), 
accession ID 3550. Use of this data set during the publication moratorium period 
was approved (S. Strome, personal communication). The raw unfiltered ChIP- 
chip data are deposited at GEO under the Subseries entry GSE30789. H3K4me3 
ChIP intensity signals were divided by Input signals, log transformed, centred to 
mean zero, and scaled to standard deviation one. H3K4me3 enrichment peaks 
(4493) were called using the program ChIPOTIe (ref. 45, http://sourceforge.net/ 
projects/chipotle-2/) with a P-value cut-off of 10 *°, window size 500 bp, step size 
100 bp, and the Bonferroni P-value correction. A list of gene coordinates (tran- 
script start-end) was obtained from WormBase WS170 (http://www.wormbase. 
org/). Peaks were mapped to 5,062 genes by identifying the genes that had peaks 
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overlap with their 5’ region (500 bp upstream and downstream from the tran- 
script start site). For comparisons between different data sets, hypergeometric 
probabilities were calculated using http://stattrek.com/Tables/Hypergeometric. 
aspx. 
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Yangling Mu, Gong Chen, Diana Yu, Shane McCarthy, 
Jonathan Sebat & Fred H. Gage 


Nature 473, 221-225 (2011) 


In this Letter, the scale bars were calculated incorrectly for each image 
throughout the manuscript and 60 X objectives were reported as 63. 
Although all the images are correct as published, the length repre- 
sented by every scale bar was wrong. The figure legends for Fig. 1 (the 
three scale bars are now all 100 pm and ‘630’ should be ‘x 600’) and 
Fig. 2 (the scale bar is now 80 |1m) have been amended in the PDF and 
HTML versions online. Similar errors in Supplementary Figs 2-5 and 
7-10 have also been corrected online. We very much regret these 
errors. These changes do not alter any of the conclusions of this Letter. 
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J. L. Sarmiento, N. Gruber, M. A. Brzezinski & J. P. Dunne 


Nature 427, 56-60 (2004) 


In this Letter, an error in the legend of Fig. 2a was drawn to our 
attention by G. de Souza (Eidgendssische Technische Hochschule 
(ETH), Zurich). The correction of Si* for denitrification was 
erroneously described as “We correct for this by using N* (ref. 30), 
to define a corrected Si* = Si(OH), — NO3_ — 6(N*) where 0 is set to 
one if N* is smaller than —2 jumol kg’ * and to zero otherwise.” The 
actual correction used in producing Fig. 2a was done as follows: “We 
correct for this by using N* = NO; — 16PQ,, to define a corrected 
Si* = Si(OH), — NO, + 6(N*) where 6 is set to one if N* is smaller 
than —3 umolkg’ * and to zero otherwise.” This has been corrected in 
the HTML version online. 
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